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Errata (Annals Vol. Ill, No. 1) 


Page 46, Equation (3) should read 


y- 
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o&W 


f Me 


Zct z 
instead of 


y oHW * (x)e 


-[/ M-Mf 


Page 50, Table I. For the first step-interval the constant 

k should be negative, that is, -.30102 99956 
For step-interval 9, k should be .65321 
25137 instead of .64321 25137. 


Page 51. the equation should read 



F log 2 x c z 
A/ 

instead of 


CL -c z 

9 J N 

Page 52, first line, should read F togx's instead of F/ag z x‘s 

Page 55, eighth line, should read “of the number and r is the 
remainder” 

Instead of “of the number and is the remainder.” 




DISTRIBUTION OF THE MEANS DIVIDED BY THE 
STANDARD DEVIATIONS OF SAMPLES FROM 
NON-HOMOGENEOUS POPULATIONS 

By 

G. A. Baker 


In a previous paper 1 2 * the distributions of the means and vari¬ 
ances, means squared and variances of samples of two drawn from 
a non-homogeneous population composed of two normal popula¬ 
tions have been discussed. It is the purpose of this paper to dis¬ 
cuss similarly the distribution of the means of samples of two 
measured from the mean of the population divided by the stand¬ 
ard deviations of the samples for such parent populations and to 
present experimental results for samples of four. 

CASE 77 = 2 

Suppose that a population represented by 


(l) 


Kx)- 


N, 


i (z±rnjf 
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1 (x- m z Y 


N, 


cf,JZrr 




-N,m, + N z m z =O 

is considered. If n~s individuals come from the first component 
and s from the second in drawing samples of n and if fh is the 
mean of the sample measured from the mean of the population 
and a is the standard deviation of the sample, 4 then 
( 2 ) 
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1 Annals of Mathematical Statistics , Vol. 2, No. 3, Aug. 1931, 

2 “Random Sampling from Non-Homogeneous Populations”—Af etron, 

Vol. 8, No. 3, p. 6. 



2 DISTRIBUTION OF THE MEANS 

where 7?7„ m £ , c^and a* are estimates of the corresponding 
parameters of (1). 

For the case m = 2 when both individuals come from the first 
component of (1) it is known that the distribution of the means 
divided by the standard deviations of the samples is proportional 
to 

(3) 

da 
l + u z 


the origin of a being taken at - -pJ- . Similarly, when both_in- 
dividuals of the sample come from the second component, r=r is 
distributed as proportional to 


(4) 


du/ 


1 + u/ z 


the origin of w being taken at -ZS. 

When one individual comes from each component (2) be¬ 
comes 

( 5 ) 



because no estimate of the standard deviations of the component* 
of (1) can be made from one individual. The distribution of ¥n, 
is proportional to the first component of (1), and rh z is distributee 
as proportional to the second component. The distributions of 777 
and fn z are independent. 

Expression (5) can be rewritten as 


( 6 ) 



Put 
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( 7 ) 


m z 

m, 


= w. 


If the distribution of v/ is found, the distribution ot z = & may 
be found by making the transformation 
( 8 ) 


2 = 1 - 
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1+v 


or 

( 9 ) 



and 

( 10 ) 
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The distribution of V is a special case of the distribution of an 
index both of whose components follow the normal law. That is, 
we seek the distribution of 


l/= 
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x and y being distributed as 

on ' 


z 0 e 




Cx-xJ Cy-y) 
a, cr z * a/ J 


This distribution may be obtained as follows. 

Lemma I.*»* If two variables % and y,~°o6x.<* o, -eo<y<oo 


* (I<oc. cit.) 

8 Baten, W. D. “Combining Constant Probability Functions M >— 
tan Mathematical Monthly , Oct., 1930. 
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DISTRIBUTION OF THE MEANS 


are so related that the probability of an x in dx and of a y in dy is 
ffx,y)dxdy 

then the probability that v* x-y is in dv is proportional to 


f f(v+y, y) dy 

** — (VO 


dv. 


Consider, first, the portion of (11) in the first quadrant. Put 


v = 


and take the logarithm of each side, thus 


(12) log v = log y- log x. 

Put 

I = log V 
w - log y 
a » log x 


and (12) becomes 

(13) I=w-u 

where the range of m/ and a is - oo to + oo . The equation of the 
correlation surface of w and u is proportional to 

(14) 


FCn/ t w)= e u e 


l 

z FT* 


(clZXc w -y>. 

— 7r+-iy- J 


Hence, F(u,I+a)da when the transformation e u *x is made, 
becomes 


(IS) 


* 

r 2 i-r* 

xe e 



where x ranges from O tow. By the application of Lemma I 
the proportional probability of a value of v in dvvjhznx. andy are 
both positive is obtained by integrating (IS) from O to oo with 
respect to x and making the transformation v' - e 1 . Thus, 
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(16) 
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where 

a =■ 2 rcr, cfg,v + afv 2 ' 


b « (<f, z y - ref, d z 5Z) v+(d*x - r<r, ^ y ) 


is obtained. The distribution of v if both x and y are negative 
(and hence v positive) is the same as (16) except that the last 
term is reversed in sign. Thus, for y positive the distribution of y 
is proportional to two times the first two terms of (16). If y is 
negative, i.e. x negative and y positive or x positive and y negative, 
(16) is obtained in one case and (16) with the sign of the last 
term changed in the other case. That is, the distribution of v is 
proportional to the first two terms of (16) when v ranges from 
-c*> to + oo, 

In our case r-0, m, = x, rn z *y. Hence, the distribution 


of i/ becomes proportional to 
(17) 


°iSkl 




(cg+e, 


v z ) 


Crrty- m 2 f _, z 

+ <r, Z m z v'+d£m i Z(cr*+a?v*) f 0^0*77} 

(d z + a t z v*) 3 ** J 0 

From (8), (9), (10), and (17) z is distributed as proportional 


to 
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(18) 


I rn 

i, a z a z L^' 


Z 2, 

TTh j. rru 


- [z (- rn,-rrtji)-( m, - -mj \ . cr,ff z A #/ * -iu 

+ * A l * 1 

where 


A - (cr l z )+cr*)g, a -+2(o, z -<T£)z+(<?*+ <j z ) 


and 

z. (cr* 777 , - cr^Tr/g) - (<T, z m 2 + cr z 777 ,). 

The origin for z is at the mean of population (1). 

Thus the distribution of -M- for samples of two drawn from 
a population represented by (1) has been completely determined 
as being proportional to 
(19) 

k,(3)+k z (4) + k 3 (l8). 

Let A lt Agpnd A 3 be the respective areas under the curves repre¬ 
sented by the three terms of (19). Then k f , A^.and k 3 are to 
be so determined that 

A, + A z + A 3 =■ N 

where N is the total number of samples considered, and that 

1 k_* _ Zk_ 

A / Az A 3 



where 


k 
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Expression (19) indicates, in general, that if the means of 
the components do not coincide and if one component is not large 
compared with the other, both tails of the distribution of the 
means of samples measured from the mean of the population 
divided by the standard deviations of the samples are heavier for 
populations of the type (1) than for normal populations. In case 
the means of the components coincide one tail will be heavier 
(cf^ 4- <?*). In any case at least one tail will be heavier. 


EXPERIMENTAL RESULTS 


Samples of four were drawn from a population approximate¬ 
ly represented by 
( 1 ) 


f(x> 


646 

sjzn' 


, (X-/S.5) 
~Z 25 


646 

*5]ZTT 


, (X-3Z.5) Z 
4 Z5 


which is the same as Population I in the first reference. These 
samples were drawn by throwing dice. The means of these 
samples were calculated and referred to the mean of (1) as an 
origin. The standard deviations of the samples were obtained and 
calculated. A grouped frequency distribution of 1038 of these 
values is presented in Chart I and in Table I. Large values are 
obtained more frequently than would be expected from a normal 
population. 
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TABLE I 

Grouped Frequency distribution of 1038 Values of rr)/a 
for Samples of Four from Population I 





G. A. BAKER 


9 


CHART I 

Grouped Frequency Distribution of -1038 Values ot m/J 
for Samples of Four from Population I 



ft 
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A STATISTICAL APPROACH TO MATHEMATICAL 
FORMULATION OF DEMAND-SUPPLY- 
PRICE RELATIONSHIPS 

By 

Robert W. Burgess 

A scientific approach to the practical problem of forecast¬ 
ing the prices of commodities clearly requires the develop¬ 
ment of methods of a somewhat mathematical type for 
analyzing the relationships between demand, supply, cost, and 
price. In the case of cotton and other annual crop agricultural 
commodities, the multiple correlation, link-relative and trend- 
ratio methods as applied by Moore, Schultz, B. B. Smith, 
Ezekiel, Holbrook and E. J. Working, and others, have dem¬ 
onstrated their worth. But for copper, lead, rubber, and similar 
commodities not on an annual crop basis, where quantity 
produced, or the quantity available, in a given period cannot 
logically be considered as the supply linked to the average 
price of that period, the method seems inapplicable and an¬ 
other type of approach is necessary. For this reason, and be¬ 
cause of the failure of price to function as expected as a major 
regulator in our present money economy, it seems worth 
while to attempt to develop a general mathematical procedure 
involving cost, demand and supply functions and to analyze 
elasticity of supply and elasticity of demand as a mathemati¬ 
cian naturally does. But some mathematical studies along 
these lines have not seemed to represent a truly scientific ap¬ 
proach to the problem, however helpful they may be in sug¬ 
gesting potentially valuable ideas. 

The mathematical economist of the non-statistical type 
sometimes seems to believe that he has contributed to the 
solution of economic problems if he finds an answer in the 
form of a mathematical equation with undetermined con¬ 
stants. The determination of these constants is left as a sec- 
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ondary step to the statistician. But in certain cases, at least, 
the determination of these constants would be at least as dif¬ 
ficult as the original problem. Moreover, the assumptions 
made m setting up the equations, while not always stated 
explicitly, have not been shown to constitute a sufficiently 
close aproximation to actual non-exceptional conditions for 
the analysis to be useful. It may be suggested that, according 
to correct scientific procedure, careful statistical analysis of 
known and knowable conditions is necessary before worth 
while mathematical formulation can be attempted. 

To bring out my point of view by discussion of a particu¬ 
lar equation, Professor Evans, in his “Introduction to Mathe¬ 
matical Economics/* uses as a cost function, that is to say, a 
function stating total cost for goods produced in a given unit 
of time, in terms of the .quantity produced, u, 

q (u.) = A ll z + J3u + C 

This formula involves several unstated assumptions: 

(a) That the same continuous formula will apply over 
the entire range of cl which may appropriately be consid¬ 
ered in discussing a particular problem. Actually, several 
points of discontinuity would be more normal. In fact the 
natural method of statement of this relation might be mere¬ 
ly a number of discrete points rather than a function de¬ 
fined for all values of a. For instance, under the conditions 
of a particular problem, increase in production might be 
accomplished solely by increasing the number of machines 
engaged in that process. If so, any rate of output other 
than an integral multiple of the normal output of the ma¬ 
chine might represent inefficient operation and therefore 
be barred from further discussion. More generally, I should 
expect a very high unit cost for low values of u, when pro- 
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duction is on a job basis, with abrupt downward steps as 
quantity production methods are applied, and finally a near¬ 
ly horizontal line. 

(b) That a term of the type A u z is worth consider¬ 
ing in the typical case. Some analysis seems called for as 
to when this term is appropriate. In mining and agricul¬ 
ture, some such increase of cost with increasing quantity 
undoubtedly does occur, but it is difficult to imagine prac¬ 
tical cases of factory production in which unit cost in¬ 
creases with quantity at any such rate if advance notice 
of contemplated increase has been given. If production 
beyond present normal plant capacity is desired, such forced 
production might involve some temporary increase in unit cost 
as production increased, but in these days such a case would 
be unusual. 

(c) That the variation of unit cost with quantity is 
important enough to justify singling that out as the single 
factor of variation, although actually my impression is 
that for rather broad ranges of quantity, other factors are 
more significant causes of variation in costs. Such factors 
include regularity of production rate, the weather, labor 
conditions, character of supervision, and management pres¬ 
sure reflecting price conditions for the product. 

C 

(d) That ~ is an adequate expression for the element 
of unit cost which decreases as u increases. Accounting 
discussions of total cost usually emphasize, in addition to 
constant overhead, certain elements of cost which increase 
somewhat with the quantity produced, but not in direct 
proportion. There is also the very sharp reduction in unit 
costs when production is first initiated. As a first hypothe¬ 
sis as a basis for detailed statistical study, I suggest that 
a term of the type Ce * a might be useful, either alone 
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or with the term u , to cover the elements of unit costs 
which decrease as u increases. 

(e) That some definition of cost of a logical nature 
can be framed and applied to this case. 

The statistical aproach to a cost function might well 
show that the range of applicability of a continuous formula 
is rather narrowly limited and that a careful statement as 
to attendant conditions is at least as important for reliability 
of the results as precise determination of the constants. Un¬ 
fortunately, the results of statistical experiments along these 
lines are not available. In these days of stiff competition and 
rigorous government regulation, industrial concerns will be 
reluctant to permit publication of the kind of analysis of 
costs which is essential to what seems to be the correct sci¬ 
entific approach to this problem. It is hoped, however, that 
the preceding discussion has been specific enough to show 
the nature of the analysis which I think should precede the 
formulation of an expression of functional relationship, and 
to suggest the kind of discussion which a mathematical 
economist should include in his results. 

Returning to the general criticism that the treatment of 
supply-demand-price relationships by certain mathematical 
economists is not truly scientific, let us review in broad lines 
the history of scientific progress in those lines where it has 
admittedly been successful. The steps have been about as 
follows: 

1. Creation of a serviceable mechanism for the meas¬ 
urement of data. In the case of astronomy, for instance, 
this required the invention of the telescope and general 
agreement on angular measurement. 

2. Careful making and recording of observations with 
this approved system of measurement, the observations 
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being in certain cases of events over which the scientist 
has no control and in other cases of experiments whose 
conditions could be modified at his convenience. 

3. Derivation of empirical laws from these data. 

4. Discovery of fundamental principles. 

It is true that these steps are not in fact separated as com¬ 
pletely as the outline might suggest, and that attempts at 
the discovery of fundamental principles often help in formu- 
lating-the plan according to which observations are recorded 
and give the workers a motive for intensive effort. I think 
the statement will stand, however, that very few discoveries 
of fundamental principles have been made until substantial 
results have been secured under the (1), (2) and (3) headings. 

For example, Newton’s astronomical laws were discov¬ 
ered after Huyghens and others had made the telescope a 
useful instrument, Tycho Brahe had made an enormous num¬ 
ber of observations, and Kepler had deduced empirical laws 
for those observations. 

Again, in actuarial science, the whole structure of mod¬ 
ern life insurance became possible only after careful vital 
statistics had been recorded for many years and analyzed by 
the empirical laws of Gompertz and Makeham. 

On the other hand, when we turn to treatises on theo¬ 
retical economics or to books and articles on mathematical 
economics, there seems to be no trace of the careful record¬ 
ing of observations or their analysis by empirical laws as 
the basis for their theoretical discussions. I admit, of course, 
that mathematical formulas are stated which look like em¬ 
pirical laws, but no references are given to any studies justi¬ 
fying these particular formulations. If we imagine ourselves 
starting the scientific procedure described above as the basis 
for arriving at real economic laws, we note almost at once 
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that agreement as to the meaning of fundamental terms has 
not yet been secured. For instance, cost of production is one 
idea which is fundamental in analysis of demand-supply-price 
relationships, but cost accountants and economists are by 
no means agreed among themselves as to what the term 
should cover. Under the circumstances, it seems to me that 
the most profitable scientific approach at present would be 
to analyze various relationships which can be put on a quan¬ 
titative basis, with special attention to noting all the special 
circumstances of the cases analyzed. For example: 

(a) In many cases it would probably be possible to 
study the relationship between the price of a manufactured 
article and the price of the raw material or the raw mater¬ 
ials used in making it. A simple case which I have actually 
done in my office and used in price estimating is determin¬ 
ing the price of cotton yarn in terms of that of raw cotton. 
We find it advantageous to compute the cotton yarn price 
according to the formula, compare that with the actual 
price and note the relation to general business conditions 
or to competitive conditions within the industry. The sta¬ 
tistical methods required for a problem of this type are 
obvious, but relatively little, I believe, has been done on 
this line. Broad comparisons of index numbers of prices 
of finished goods and raw materials I regard as another 
kettle of fish altogether. 

(b) Relationship of change in price to change in stocks. 
Any study of actual data of the commodities, such as cop¬ 
per, lead, and rubber, shows that price tends to decline 
when stocks increase and rise when stocks decrease. A 
first step in the quantitative approach to price forecasting 
is to obtain a more precise formulation of this correspond¬ 
ence. It may be noted that some rather vague mathemat- 
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ical ideas come to the surface in discussions of these rela¬ 
tionships. For instance, if in a given month production 
has decreased and consumption has increased, it is some¬ 
times said that these are two arguments for higher prices, 
and prices are expected to rise. But it may happen in cer¬ 
tain cases that even with such a decrease in production 
and increase in consumption the month’s production still 
exceeds consumption and stocks are increasing. On the 
whole, then, the monthly figures point to lower rather than 
higher prices, and it is a useful duty of the mathematician to 
point this out. 

An audience of mathematicians probably regards the 
preceding illustration as trifling. I bring it up to illustrate 
the fact that progress toward a more mathematical atti¬ 
tude in commodity forecasting must proceed step by step. 
A more advanced stage in the quantitative formulation 
is, of course, to determine the equation connecting change 
in price and change in stocks as reported monthly. This 
also has actually proved useful. 

(c) Exact definition of the phrase “cost of production” 
as actually effective when the problem is: 

1. Establishing an appropriate price under regulated 
monopoly conditions. 

2. Determining which manufacturing or mining en¬ 
terprise will survive. 

3. Shutting down established sources of supply. 

4. Creating new sources of supply. 

As I see it, there are at least four costs of production, eacl 
of which is important under certain circumstances. 

1. Complete economic cost, including interest on er 
tire investment at an appropriate rate. 
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2. Economic cost, excluding interest on the capital 
value of ownership. 

3. Out-of-pocket expense, which excludes depreci¬ 
ation charges in excess of actual replacements in the 
period considered, interest on investment, and design or devel¬ 
opment expenses. 

4. Economic cost plus reward for the enterpriser 
over and above interest at a reasonable rate on his in¬ 
vestment. 

Roughly speaking, it may be that these are in order closely 
related to the costs required for the problems stated above. 

(d) Analysis of changes in cost at different price levels. 
One point which has been emphasized by the experience 
of the past year or two is the fact that cftsts are by no 
means kept constant when the price varies. For instance, 
the best information available two years ago suggested 
that at 18^ New York, certain producers of rubber would 
begin to drop out because of costs above that figure. As 
a matter of fact, they seem to have been able to change their 
costs. Such changes are possible by several means, for in¬ 
stance : 

1. Wage differentials varying with market price. 

2. Bonuses for officials varying with profits. 

3. Exploitation of best ores or plantations in times 
bf lower prices. 

4. Increased pressure for efficiency when essential. 

(e) Careful analysis of the relations of cost to quantity 
produced with the consideration of: 

1. The time ahead that the quantity is known to be 
required. 

2. Continuous production versus intermittent produc¬ 
tion, perhaps in lots whose size has been determined to 
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be most economical in view of manufacturing and distri¬ 
buting conditions. 

3. Mass production or production in smaller quanti¬ 
ties by trained mechanics with attention to the discontin¬ 
uity in costs per unit when the transition from one type 
of production to another occurs. 

In view of the considerations suggested in (1), (2), and (3), 
it does not seem to me that the assumption of an algebraic 
formula connecting cost with quantity produced represents 
an adequate realistic formulation of the problem. 

(f) Analysis of way elasticity of supply or demand act¬ 
ually works. Such an analysis seems to me essential before 
a definition of coefficient of elasticity is framed or made the 
basis for elaborate developments. Recent experience has, I 
think, shown: 

1. Elasticity of demand does not mean shrinkage of 
demand with high prices to anything like the extent ex¬ 
pected. 

2. Elasticity of supply functions much more slowly 
than expected at the low-price end of the spectrum. The 
frictional factors include sympathy with employees who 
would lose their jobs if economically unprofitable production 
ceased, reserve funds which permit companies to continue 
operations when even out-of-pocket expenses are not being 
fully recovered, bank willingness to lend on commodity stocks 
which are not, in fact, marketable within a short time, and 
the cost of shutting down and reassembling a working force. 

Most of the matters discussed in (a) to (f) above cannot 
be analyzed fully on the basis of published records. More¬ 
over, in view of the fact that price information is part of the 
life blood of any particular business, it will prove difficult 
for outsiders to secure much of the information which would 
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be needed to complete the analysis. A possible plan would be 
to place research specialists m industrial or marketing con¬ 
cerns to study actual data. It is not probable that the best re¬ 
search of this type can be clone in academic halls as a sideline 
to teaching. It is also unlikely that corporation officers with 
an ax to grind can themselves complete the scientific analysis of 
such material. It seems clear, however, that a satisfactory 
solution requires a coordination of both points of view. 

Summarizing the point of view I have tried to outline, I 
believe that the analysis of cost-price-supply-demand relation¬ 
ships should be relatively more inductive than if has been 
especially in the type of theoretical work classified as mathe¬ 
matical economics. On the other hand, I think that the deduc¬ 
tive approach is worth while, that we should try to formulate 
general principles in this field, and that the ultimate ideal in¬ 
volves a mathematical form,—though, perhaps, when it really 
covers price situations as they are, a mathematical form some¬ 
what different from those required in the physical, chemical 
and astronomical sciences. 





rHE DISTRIBUTIONS OF THE PRECISION 
CONSTANT AND ITS SQUARE IN SAMPLES 
OF 77 FROM A NORMAL POPULATION 

By 

H. M, Feujman 
Washington University 

INTRODUCTION 

The following paper is a study of the properties of the dis¬ 
tributions of the precision constant and its square in samples of 
from a normal population. The properties studied are (1) modes 
and optimum values, (2) the first four moments, (3) skewness 
and flatness, and (4) medians and quartiles. 

The distribution curves shown in the figure are for 77=4, 
10, and 25. All the curves are drawn together and to the same 
scale, so that a graphical comparison of the two distributions can 
be easily made for both the same and different values of 77. The 
numerical values for the various parameters given in the tables are 
for 77=4, 10, 25, and 100, except in the case of the medians and 
quartiles where the values for v = 100 are omitted, and in case of 
77 * 4, no moments higher than the second exist for the precision 
constant, and none higher than the first for the precision constant 
squared distribution. 

I. Distributions 

Let us denote the standard deviation, precision constant, and 
the precision constant squared of the parent population by S, H , 
and U, respectively, and those of a sample from the given popula¬ 
tion by s t / 7 t and u> respectively. The standard deviations, is then 
defined in terms of the variates, pc, , x 2 , . and its mean 

X by the equation 


ii 


( 1 . 0 ) 
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•We also have the following well known relations between S,H, 
and U, or s,h, and u: 


3 = 


'JZh 


(U) 


s z =i— > or u=h z 
Zu 


( 1 . 2 ) 


The distribution of s as given by R. A. Fisher* and others is 

- 775* 

77 n jr- s v ~ z e ZS ds 

f(s)ds -—-—r~ • • • • (1-3) 


77-7 


s 77 - 1 r (djd) 


wnere 77 is the number of items in the sample. 

The distribution of s and the transformations (1.1) and (1.2) 
enable us to find the distributions of the precision constant h, and 
its square u. Thus, using (1.1) and (1.3) we get 


F(h)dh = 


77 ^ H n ’ J h’”e 


_ 77 

TTF 


dh 




.... (1.4) 


and by means of (1.2) and (1.3) we find 

n zU — u —e ~zu da, 

0(a)da - - - — ——p. - 

x ?.n=lr nd) 


. (1.5) 


2. Modal and Optimum Values 
We shall now obtain some of the properties of the distribu¬ 
tions of our parameters. The simplest of the properties of any 

♦See, for example, R. A. Fisher, Applications of “Student’s” distribu¬ 
tion, Metron, VoL 5, No. 3, (Dec. 1, 1925), pp. 90-104. 
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continuous distribution is its modal value or the abscissa of the 
maximum ordinate of the curve. Denoting the modal values of 
the distributions (1.4) and (1.5) by h and a respectively, we 
find from the condition for an extremum 


dF d<fi 

dh ’ da ' 


ii 

* 

.(2.0) 

.. n+1 ~ 

d= ~w u . 

.(2.1) 


In obtaining h and u we regard h and u as variables and H 
and U as constants. We may, however, reverse our point of view. 
That is, we regard H and U as variables and h and u as constants. 
In that case the right hand sides of (1.4) and (1.5) become func¬ 
tions of H and U , and from 


dF 

dH 


*°> aTid 7nr 0 > 


we get 



( 2 . 2 ) 



( 2 . 3 ) 


The quantities H and U R. A. Fisher calls the optimum values. 

3. Moments 
Precision Constant 

In order to distinguish between the moments of the two dis¬ 
tributions treated in this paper, we shall denote the ith moment 
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of the precision constant about the origin by (rr) and that of 
its square by (a )with similar notation for the moments about 
the mean. 

Using the general definition of a moment of a continuous 
distribution, we obtain for the first moment of 77 , which is also its 
mean 


M <hh 



nH* 

dh 


. . . . (3.10) 


To put this into an integrable form we make the transformation 


. nH z 
U Zh* 


(3.11) 


This yields for the first moment 


a. 



(3.12) 


To facilitate calculation we express this in terms of factorials. 
For this purpose we have two cases to consider, namely, the case 
when 77 is even, and that when n is odd. 

When 77 is even is an integer, and, 




rf 12 ^) _ ('77-4X77-6 )• • 

= ^ (r?-3)(n-5)---1 


= HJfi 



£ 

_(3.13) 


When 77 is odd i s an integer and hence 



rjjf 

r’X 


(77-4/77-6) • • • l I? 
(71-3X77-5) - Z U 
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=Hjrf 


(77-3) 




77-3 

T 


77-3 

T~- 


.... (3.14) 


Similarly we obtain for the second, third, and forth moments 
of the distribution of about the origin the following expressions: 


r*(^rr) 


r? 

= rF3 




(3.15) 


ft rW 


m' 3 <»)-h 3 (■■■■ <* 16 > 

r( n z ) 


4/77) z 

*// (z) ~ f„. 3Y-n-5) 


H 4 


( 3 . 17 ) 


Moments about the Mean 

To study such properties of a distribution curve as skewness 
and flatness we must have the moments of the curve about the 
mean. To obtain these we use the well known formulae for ex¬ 
pressing the moments about the mean in terms of the moments 
about any origin. Using these formulae we obtain for the first 
four moments of the precision constant,/?, about the mean the 
following expressions: 

*)•<> 


(h) * 


[ 77 - (n-3)/if (hj\ H z 

(77-3) 


.( 3 . 18 ) 


mJV * 


\Z(n-3)(.- n(2 77 -&)/u'i (rrj\ M 
(n-3Xr7-4) 


( 3 . 19 ) 




H. M. FELDMAN 


25 


y /u*\ \j7 Z (T7-4)+£ 11(77- 6 )(n- 5 )/u.[ 2 -3(r7-3Xn-4)( n- 5 )M 1 
^ (hJ " (77-3)( r?-4)( n~5) 

.... (3.20) 

where /jl[ (i?)is given by (3.12). 

For future use we shall give here approximations for jutj(h) t 
ju z (h)> ju 3 (h)i and (h). The approximations were 
obtained by expanding the various quantities into power series of 

jj. The derivation of these are not difficult but rather long and 

will therefore not be given in this paper. 

These approximations are as follows: 

. (321) 

H ‘ .<3-22) 

/Uj(h)= jfr H 3 .(3.23) 

(where a is a constant) 

H* .... (3.24) 


Precision Constant Squared 

The first moment of the precision constant squared distribu¬ 
tion is, using the general definition of a continuous distribution 
curve about the origin 

n UzL uH r v tt± 1 nU 
mUu). - - - - U~ & e zu da .... (3.30) 

o 77-3 n 77-1 J 

r ST 

We reduce this to a known integral by means of the transforma¬ 
tion 




77 U 
Z u 


. . (3.31) 
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We then obtain for the first four moments of u about the origin 
the following simple expressions: 

J u tM m Th U .< 3 - 32 ) 

u '. (3 - 33> 

^tS&SxZr)* . (3 - W) 

^ (U) * {n-3x”-3)(Ti-7)(n-9) (3,35) 


Moments about the Mem 

For the moments about the mean of the precision constant 
squared distribution we have: 


Mi 

Zn z 

M *-~ (7i-3) z (v-5) 


(3.36) 


'“s*fr?-3> 3 (v-J)( t7-7) U ' ■ ' 

. ±nlfeji till _ u + 

* f77-3) 4 (r,- 3 Hr,- 7)(n- 9) 


(3.37) 

(3.38) 


Skewness and Flatness 

From the above expressions for the mean and also from the 
numerical values given in the tables we may conclude that the pre¬ 
cision constant distribution is less skew than the distribution of 
the precision constant squared, at least for values of rj up to 100. 
But what happens when n grows very large? To answer this we 
make use of the Pearsonian measure of skewness, /8 h defined by 
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Since we are now interested in large values of 77 we make use 
of the approximate values of ju z (h)> ixAju 3 (h ), which are 
M z ( d nd M a (M* ZTz 

Hence we get 


Zn 
a z H 6 


a*t!z . 1 

hm &*(*?)• nm ~r 3 

77* o 0 + rj=o» 


•fra*/*- Hm 4r 


77 = 6C 


and 


Um =0 

71* 


(3.41) 


To find //m fi t (u) we make use of the exact values of M*f u ) 
and Ms ( u > which are given by (3 .36$ and (3.37). This gives 


Urn /&i (u) - U ?77 

77 = 77= ftO 


s H 777 
77= <y» 


r ig^u 3 1 ~ r z-n^u* T 

\fn-3) 3 (v-5)(n-7)\ [7n3F7vT57 J 

jZfv-S) 


2jn*V* -* 5 


(tj-7) z 


= o 


(3.42) 


From (3.41) and (3.42) we learn that both the precision constant 
distribution, and that of its square, approach perfect symmetry 
as the size of the sample, 77 , approaches infinity. 

The flatness or kurtosis of a curve is measured by the 
quantity 


M± 


(3.43) 


From this and the expressions (3.22 and (3.24), (3.33) and 
(3.35) we get 

. (3>44) 


and 

/- * , \ r 4ri 4 '(Zn+2r)U 4 ' ±rj*U_ 

i 7 *l> z U 77=66 (n-3)*(rj-5)(r?-7)(n-9) (v-3)* (n-$) 
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= !im 

77 =00 


{77-5)(Z 71-7-2 7) 
(n-7)(n-9) 


(3.45) 


We may conclude, then, that while the distributions of the pre¬ 
cision constant and its square are both perfectly symmetrical for 
very large values of 77 , they are nevertheless entirely distinct dis¬ 
tribution curves for both small and large values of 77 , since Hm' 
/6 z (h)=3, and Z. As a matter of fact the dis¬ 

tribution of the precision constant approaches the normal curve, 
while the precision constant squared distribution approaches a 
curve of the form 



where y o and a are constants. 

4. Quartiles and Medians 

The quartiles of a continuous distribution f (x,) may be 
defined by the equation 

r&i 

/ f(x)ctx~±, (i = 1,2,3)-(4.00) 

J o * 

For 1, Q* is called the lower quartile; for L - 2, Qi is called 
the median and for i » 3, Q' L is called the upper quartile. 

In order to find the quartiles of the distributions studied in 
this paper we must make use of the incomplete P- function. This 
function is defined as follows 


I(u,p)~ 


r(p+t) 


1 


ufUp 


-v p , 
e v dv 


. . . (4.10) 


Pearson’s “Tables of the Inconplete P- Function” give the values 
oil, a, andp. Thus, if we know any two of the variables we can 
easily find the third. 
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Let us take, now, the distribution of h, 

F( h )dh~ 

z^r(^) 


The various quartiles of this distribution will be given by 


77 



• . . (4.11) 


By making the transformation 

vH k 






(4.11) is reduced to 


1 





(4.13) 


This is, of course, I(ui , and since L and a are known 

we can easily find ll- l from Pearson's Tables. 

Comparing (4.11) and (4.13) and taking into account the 

transformation (4.12) we find that 

± 

S ‘ • H ' 

Since the lower limit in (4.13) is <*> instead of O we see that the 
lower and upper quartiles are reversed. 

To find the quartiles for the distribution of u we simply make 
use of the relation u= h . 

In conclusion we may state that the results of this paper are 
similar to those of Professor Rietz on the distributions of the 
standard deviation and the variance.* 

♦Rietz, H. L. A comparison of the distributions curves of variance and 
of standard deviation, Mathematical Monthly, Vol. 36 (August-Sept., 1929), 
p. 355. 
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TABLE I 

Modal and optimum values for the distribution of the pre¬ 
cision constant and its square from a normal population for 


s amples of 4, 10, 25, and 100. 


Pr. const, h 

Pr. const, squared u 

rzzz 

Mode 

wessmbsm 

Mode 

Optimum 


H 

1.154 A/ 

0.800 U 

1.333 U 


H 

1.054 H 

0.909 U 

1.111 u 


H 

1.021 H 

0.962 U 

1.042 U 

100 

H 

1.005 H 

0.990 U 

1.010 U 


TABLE II 

Values of the mean and the first four moments about the mean 
of the distributions of the precision constant and its square for 


samples of 4, 10, 25, and 100 from a normal population. 


Precision constant h 

77 

Mean ju t 

Mz _ 

Mi 

Ma. 

4 


1.596 H 


1.454 H z 





10 


1.153 H 


0.0982 H z 

0.0678 H* 

0.0815 H 4 

25 


1.054 U 


0.0255 H z 

0.0031 H 3 

0.0026 H 4 

100 


1.013 A/ 



0.000141/V 3 

0.0000907 H 4 

1 Precision constant square U I 

77 

a; 

Mi 

Ms 

A* 

4 









10 


1.429 U 


0.816 U* 


3.110 U 

3 

52.200 U 4 

25 


1.136 U 


0.129 U* 


0.06522 U 3 

0.892 U 4 

100 


1.031 U 


0.224 U z 


0.0199 U 3 

0.00129 U 4 


TABLE III 

Values of/5 ; and^ for the distribution of the precision 
constant and its square for samples of 10, 25, 100, and oo from 
a normal population._ 



h 

__z 

77 

A 

At 

A 

A 

10 

4.8548 

8.4521 


78.3416 

25 

0.7596 

3.9982 


5.3172 

100 

0.00134 

3.2289 

0.3515 

2.5476 

Q0 

0.0000 

3.0000 

0.0000 

2.0000 
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TABLE IV 


Medians and quartiles for the distributions of the precision 
constant and its square for samples of 4, 10, and 25 from a normal 
population. 



Precision constant h 

Precision constant squared u 

n 

Q, 

Median 

Qs 

Q, 

Median 

Qs _ 

4 

0.985 V 

1.298 H 

1.816 /V 

0.970 U 

1.685 U 


10 

0.937 H 

1.094 H 

1.230 H 

0.878 U 

1.197 U 

IBHW 

25 

0.941 H 

1.036 H 

1.146 H 

0.885 U 

1.0 73 u 

1.3131/ i 


Distribution curves of the precision constant and its square 
for samples of 4, 10, and 25, from a normal population. 

Note that the mode for the precisioxi constant distribution is 
the same for all values of v as is seen from (2.0). 

The solid curves are for the precision constant, the dotted 
curves for its square. 

The unit used is the standard deviation of the population 
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A POSTULATE FOR OBSERVATIONS 

By R. Henderson 

When measurements are made by a given observer using a 
particular instrument, if the mean or expected result of the mea¬ 
surements is not supposed to be equal to the true value of the 
quantity measured the difference is considered to be an error, 
personal or instrumental or both. A correction is therefore ap¬ 
plied to any such measurement in order to remove the discrep¬ 
ancy. In other words a given combination of observer and instru¬ 
ment is not considered to give correct or balanced measurements 
until this discrepancy is removed. Also as between two instru¬ 
ments or observers, both giving balanced measurements after the 
application of known corrections, preference is $iven to the one 
which shows the smaller variations between different measurements 
of the same quantity. 

In selecting the formula to be used to determine, from the 
results of a series of measurements involving certain unknown 
quantities, the best measures of those quantities we are in a posi¬ 
tion similar to that of an observer desiring to make a certain 
measurement and selecting the best available instrument for the 
purpose. Such an observer would, in the first place, require that 
the instrument should give balanced measurements and would, in 
the second place, among a number of such instruments select the 
one showing the smallest standard deviation. This suggests the 
following definition and postulate. 

Definition A balanced measure of a quantity is one of which 
the mean or expected value is equal to the true value of the quan¬ 
tity measured. 

Postulate Of two or more balanced measures of a quantity 
the best measure is the one which has the smallest standard 
deviation. 
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Repeated Measurements 

When we have a number of different results 

< 3 -/ , a z . cl?} 

of balanced measurements of the same quantity a , then any 
function of the form 

4 a, * + • • •_+_ f_n 

A * 4 + 


will also be a balanced measure of the quantity. If the standard 
deviations of the individual measures are respectively 

<*2 ■ ' ' ' fn' 

the square of the standard deviation of the function will be 


nZ 2 nZ 2 


CA * < * 


• • 

• 


This is a minimum when ^ = • - • - * so that 

the best measure is the average of the individual measures each 

weighted inversely as the square of its standard deviation. If 
the individual measures have the same standard deviation this 
reduces to the ordinary arithmetical average. 

Combinations of Observations . 

In applying this postulate to the theory of combination of 
observations, suppose that there are rr unknown quantities 

%l(l = 1,2,3 • * • • n and that we have balanced measures of 

77 i fm > r?) linear functions of these unknowns of the form 
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For simplicity we shall assume that the functions have been so 
taken that the standard deviations of these measures are all equal. 
Then a linear function of these measures of the form. 


/j* 7T7 


h=m L = 77 


ZL b hi y h = Z Z 

h=i J n i=l 


b hj a hL 


( 2 ) 


will be a balanced measure of xj if 

hxm 

£, b »j a hi = & f 


(3) 


where 6j =1 if l=J and Sj" = 0 i f i.tj 


h=m 


IS a 


It will be the best measure of that type if 2T b 

• h-1 

minimum subject to those conditions. 

By the method of indeterminate coefficients we find that this 

occurs when we can write 


4 */'&$*"** W 


and the values of the n z coefficients 
are determined from the n z conditions (3). Then 

k* T7 h=m k~"n 

X J’%, %/ a hkyh tjkYk ( 5 ) 


it we write 
77 unknowns xj 


ft* 737 

for L a hk y /7 

r7= / 


The value of each of the 


is thus expressed in terms of the 


func- 


77 
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tions y* or, in other words, may be determined from the n 
equations expressing^ interns of the 77 unknowns xj 
These equations take the form, if we write 
Ajk {or ^ a hj °hk 

A Jk 10 j ( 6 ) 

It will be noted that although no assumption regarding the law 
of error, other than that of balance, has been made the equa¬ 
tions deduced are the same as those derived, in the ordinary 
theory of least squares, from the assumption of the normal ex¬ 
ponential law. 

Measurement of Probabilities. 

Where the quantity measured is a probability and the measure 
is to be determined from the observed result of a finite number of 
trials we know that, if the probability is p the number of 
trials 71 and the number of occurrences of the particular re¬ 
sult r then the expected value of r is n/d Consequently r/rj 
is a balanced measure of p . The measure usuaily associated 
with Bayes’ theorem, namely rui. is not a balanced measure. 
Its mean value is p+ which is not equal to p unless 

p happens to be equal to Yi. 

For this case a different postulate might consistently with 
the general methods of science, have been proposed as follows. 
That hypothesis is to be adopted which makes the compound prob¬ 
ability of the hypothesis and the observed facts a maximum. If 
then we considered one value of the probability as likely as an¬ 
other this would mean selecting the value of p which would 
matfp p r (l-p) T1 ' r a maximum. This would have given 
r/rjas before. 
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Frequency Distributions 

The notation on the subject of moments is so unsettled that 
it appears to be necessary for each writer to specify the notation 
adopted. In this paper the Wth moment about the origin in a 
finite sample will be written m r and the corresponding moment 
about the mean value will be designated by ju r . The moments 
in the population from which the sample is drawn will be written 
rn r and J2 r respectively. 

In this connection an important consideration arises from the 
fact that balanced measures are not always consistent under ordi¬ 
nary mathematical transformations. This happens because if y 
is a balanced measure of x then f (y) is not necessarily a bal¬ 
anced measure of f(x) Let y s Xf/7 and let mean 
values be indicated by prefixing 777 , so that 

fr?j (H) = fft s (y)-x* = O. 

Then since 

f(y) - f(h)+ hf'M-r f "(x)+etc 

we have _ * _ 

rf7 t f f(y)} - ~ f (x)+ c - ^(*0+ ^ fex) + etc 

Ordinarily therefore unless f(x) is a linear function of tc 
or, if not, y is an exact measure of x, rn, £ f(y) } will 
not be equal to f(x) . 

A simple illustration of this fact arises in connection with the de¬ 
termination, from a sample, of a measure for J2 Z . By ordinary 
transformations we have the well known formula ju z szrrt jr rn i 
Also m z and rn, are balanced measure of 777 z and fn t re¬ 
spectively but rn* is not a balanced measure of m* . We 
have in fact 77 i,( rnf) = rn t z + ^ Ju z . Therefore, 

Mz = 0~h)M z - 

The balanced measure of J2 Z would therefore be Thl yu z 
which is not formally consistent with the balanced measures of 
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TV, and rrj z in the light of the equation ju 2 - m z - mf. By a 
similar line of reasoning as shown by Thiele, we obtain 

77 Z 

(rt-/) { 77 -%) /U ' 3 as a balanced measure of ju 3 and by 
Tschuprow’s modification of Thiele’s analysis 

(n-!)(n-Z)(77-3) { (r>Z - Zri - h3) - 3 (Zu- 3 )jm a } 

as a balanced measure of ju z . Here however we are faced 
with the further difficulty that while this is a balanced measure 
its standard deviation for small values of r? is so great that 
possible values of and would result in negative values 
of JCl 4 whereas in any real frequency distribution not only, must 
M+ he positive but (Ju z ju^ -Ju 3 ) which is the mean 

x z ) z (x z -x s jp(x 3 -x,) z 

value of -r- must also be positive. 


If, therefore, we wish to derive a value of certainly satis¬ 

fying this condition we must determine the average value of 

0cj-x^) Z (x z -x 3 ) Z (* 3 -*,)* 


e 


for all combinations of three 


values from the sample of 77 and use it as a balanced measure of 
(/ 2 z JUjl- ju 3 z ) . This average value is found to be 


77* 


The analysis is as follows. 

— C(Xr*2 )Z(X Z X -3 )Z(X 3- X l)*' ] 

l-6- I 


c *. z 0 3 Z + i 

= tTTjlx, X 3 - x z X 3 

__ _ —. _ _ __ 2 Z _ 3 

- 77?^ 7TJ ^ + Z 777 S rr? z m, - 777^ 7rr f - TH 3 - 777g 


- X. 2 L XL 


i-/£- a? 
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If repetitions were allowed in the finite sample the average 
value would be the corresponding expression in moments of the 
sample, namely, . But since the ex¬ 

pression vanishes if two or more of the values of X involved are 
equal the exclusion of repetitions reduces the total number of 
permutations three at a time from 77 s to t ?- z ) 

without reducing the sum of the values. The average is there- 
increased to 


(r?~l)(n-Z) 
The second moment /2 Z 
as the mean value of (jUl Z. 
the mean value of 


^ Mz M4 ^ 

might have been similarly derived 
—and the third moment ^u 3 as 


Czx, -x z -x 3 )(Zx z -x 3 -x t )(Zx 3 -x,-x z ) 


6 

In this latter case the expressions averaged do not vanish when 
two only of the value of % are equal but they cancel one an¬ 
other in pairs so that their sum vanishes. 

We have thus as working approximations 




77 

71-1 


_ T7 Z _ 

^3 * (77-I)( n-Z) 

The net result of this investigation of the application of bal¬ 
anced measures as presumptive values of moments in frequency 
distributions seems to be that, in view of the formal incon¬ 
sistencies involved, it is necessary to carefully select the func¬ 
tions to which such measures are applied. The functions con¬ 
sidered above are suggested as well adopted for this purpose and 


^3 
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as probably sumcient for all practical purposes. If they are 
adopted as fundamental the resulting approximations for the 

Pearson constants /&, - M* /Mz ant * M*/ 

are 


/S, --M Jn = U&dL u 2 /u 3 - H&zU- /S. and 

A*3 r A*z - (u.gjZ A+3 /A*z - (n-z)l 

a . a -j » z^EL~£jL = fz 7 ' 0 * 6L& *£± 

’ll 'v /j.* n(n-Z) Mg 

,Oz!>L 

77f77-2) 

It will be noted that the coefficient in the latter equation is 
very nearly unity for even moderate values of 77 . 




A SHORT METHOD FOR SOLVING FOR A CO- 
EFFICIENT OF MULTIPLE CORRELATION 


By 

Paul Horst 

The method which we present presupposes a familiarity with 
the Doolittle method 1 for solving normal equations. We start 
with the determinant 





r !Z 

77 

(1) 

p= 

r tz 

1 - 

r zn 



In 

In 



where the elements are zero order coefficients of correlation. 
Now the adjoint determinant of (1) may be written 



K 

to 

1 

1 


(2) 

V,z 

1 

1 



&in 

1 

1 

0? 

^7777 


where the elements are the cofactors of the elements in (1). 

From the elementary theory of determinants * we know that 

( 3 ) r *#”' 1 


The adjoint determinant of r may be designated by KR 
where 

( 4 ) KR = r ”'■* 


1 Mills, F. C M Statistical Methods, p. 577. 

2 Bocher, Maxime, Introduction to Higher Algebra, p. 33 . 
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From (3) and (4) we have 




Cn-lf 


or 

( 5 ) 


K - 


j-, r?(rj-2) 


Hence the adjoint or r is obtained by multiplying each ele- 


ment of 

B by 

And if we write 





- 

*,r, 

(6) 

KR = 

A tz 


A zv 



*,n 

1 ' 

1 

1 

1 R 
* 

A nn 

then 





(7) 



it 


so that (1) may be rewritten 





An 

'W Ff=z ' 

t 

1 

1 

1 

1 

dlM-, 

if™ 

(8) 

J? = 

Atz 

jpn-Z 

Aju 




Am 

Agn 

Arj77 







The numerators of the elements in (8) are the cofactors of 
the elements in (2). 

Let us now consider (8) as the coefficients of a set of normal 
equations whose constant terms are zero, and let us follow through 
literally the Doolittle elimination process. 

For simplicity we outline the reduction of a 4-variable problem 
as follows. 
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The oC -equations are the original equations with the co¬ 
efficients to the left of the diagonal omitted. The -equations are 
the product equations which are subtracted from the ©C -equations. 
The ^-equations are the reduced equations which may be repre¬ 
sented symbolically by 


(9) y-o^-z/5 


The S-equations are the ^equations divided by the negatives 
of their respective leading coefficients. That (9) is true may be 
readily proved from the theorem 3 


( 10 ) 


A if 

AU 

A kj 

A /rt 


= AAijkt 


Where the notation indicates cofactors rather than minors. The 
proof is made more obvious if (9) is written 

f m { fa~A)-A^-Aaletc, 

for each successive subtraction reduces the determinant by one. 

If we indicate the leading coefficients of the ^-equations by 
Yu we may prove that 


(11) /P= ffrn 

We have in the case of four variables 

. du C^uzz. rdixzzM Cda sz as. - cL 

(12) 7 K* K*A,, X*A"ut 

A = r 

or from 8 

=* 

In the general case we have 

T>-1 T^Crt-l)* 

V7 , J2 _ - ** _ = Jp 

TV*Li - g>f7T-2>7? “ 77(77-2) ^ 


8 Bocher, Maxime, Introduction to Higher Algebra, p. 33. 
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Now let us consider Kelley’s equation for the coefficient of 
multiple correlation 4 with a slight change in notation to be con¬ 
sistent with the above, 

< 13 > 

Obviously 

(14) 

So that (13) becomes simply 


^77 /*—~ 77 

But from (9) we have 

77 

(16) 

hence ^n-fZ--(-n-i) 

But = 1 therefore we get 

(17) %77-iz--(r?-i) * 

In other words s simply the square root of the last pro¬ 

duct summation. 

From (17) it is obvious that the solution for the coefficient 
of multiple correlation is considerably shorter than the standard 
Doolittle solution for regression coefficients. All of the back solu¬ 
tion work is eliminated, as is also the calculation of the last re¬ 
ciprocal. 

The only caution needed with respect to the order of the 
variables is that the dependent variable shall be the njth variable. 

The usual summation check method may be employed exactly 
as in the solution for regression coefficients. 


Kelley, T. L., Statistical Method, p. 301, eq. 275. 



A SHORT METHOD AND TABLES FOR THE 
CALCULATION OF THE AVERAGE AND 
STANDARD DEVIATION OF LOG- 
ARITHMIC DISTRIBUTIONS* 


By 

Thomas N. Jenkins 
New York University 

In fitting various types of curves to reaction-time data, 1 the 
writer was impressed with the enormous amount of labor and 
boredom involved in the calculation of the constants of logarithmic 
distributions. Besides the constant use of a set of logarithm tables, 
it requires the tedium of squaring large numbers on a machine to 
compute the second moments of the distributions. In order to 
eliminate some of the labor involved in such a process, a short 
method was devised for the computation of the average and the 
standard deviation of logarithmic distributions. 

The short method described in this paper was originally 
developed to facilitate the work of fitting logarithmic normal curves 
to a large number of reaction-time distributions, but dispersions 
approximating this type seem to be sufficiently common in econ¬ 
omics and biology to warrant a more general fise of short methods 
in the computation of the constants of such distributions. In the 
field of economics, logarithmic curves have been fitted with suc¬ 
cess to distributions of income and prices, and probably could be 
applied equally well to distributions of capital. Many skewed 
distributions can also be found in the fields of biology and psy¬ 
chology. Kapteyn fitted a logarithmic curve to a distribution > of 
the minimum weights necessary to produce a sensation of pres- 

*A portion of the work involved in this paper was carried out during 
the writer's tenure as a National Research Fellow. 

1 Cf. Facilitation and Inhibition. Arch. Psychol. No. 86, 56 p. 



46 


T. N. JENKINS 


sure. 2 Kapteyn attempts to show that logically the normal curve 
is the exception and skew curves the rule. For example, if “the 
diameters of certain ripe berries” are distributed in a normal curve, 
their volumes will be distributed in an asymmetrical curve; in 
other words, volume increase will be dependent upon size, so that 
volume changes are greater for large berries than for small ones. 
That skew curves are the rule can be shown analytically. Suppose 
certain quantities z. are distributed normally, and any other quanti¬ 
ties x are expressed as functions of 2 , thus, 


Then, 

(1) d*. = f'(x)dx 


If the frequency curve for the z's is, 



v*= — — 

aJZrT 


e 


then the frequency curve for theft’s is, 

( 3 ) Iffr)- r-i) 2 

f'Me 


/v 


za* 




It will be seen at once that the x’s cannot be distributed normally 
providedx is a non-linear function of m. If we let z = logx, then 
gLu * and equation (2) becomes 


(4) 


y - 


N 


'/ZTT 


I e 

X 


Jhgx-MY 
za*> 


This is the logarithmic curve of distribution, the theory of which 
has been treated by several writers, one of the first and most im¬ 
portant papers on this subject being that of McAllister.® The study 

2 J. C. Kapteyn, “Skew frequency curves in Biology and Statistics”. 
Groningen, p. 42-43, 1903. 

3 The Law of die Geometric Mean. Proc. Roy . Soc. 29:367. (1879). 
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of the properties of the logarithmic curve of error was undertaken 
by McAllister at the suggestion of Galton, 4 who saw the possibility 
of applying it to psychological and social phenomena. 

Dispersions approximating this type are illustrated by dis¬ 
tributions which are definitely limited at the zero point, but a more 
definite presumption in favor of the logarithmic curve is indicated 
when the real origin, determined a priori or deduced from empirical 
considerations, does not correspond with "he origin on the value 
scale. 

Reaction-time distributions are good examples of dispersions 
where a displacement of the origin is indicated by empirical con¬ 
siderations. A little reflection will show that there must be a 
physiological limit for the speed of reaction. It takes a certain 
minimum time for the neuro-muscular machine to do its work. 1 The 
time it takes for the machine to do its work constitutes an un¬ 
disturbed region within which no deviations ever occur. Reaction¬ 
time dispersions approximate the logarithmic more closely than the 
normal curve of error. Investigations in the field of learning often 
give distributions which have origins other than the zero of the 
scale which can be determined a priori . . . .that is. the real origin 
follows inevitably from the conditions of the experiment. If the 
norm of mastery for learning a maze is two perfect trials out of 
three, then the criterion is such that an animal to learn a maze must 
make at least two perfect runs. In other words, the experiment¬ 
er's criterion is such that no deviations could possibly occur under 
two trials. 

In using the short method for finding the first and second 
moments of a logarithmic distribution, the computer must still re¬ 
sort to a table, but in this case it is only necessary to use a single 
page table instead of an extensive logarithm table. Furthermore, 
the labor of squaring the logarithms is eliminated. The short 
method can best be explained by following the process through an 

4 The Geometric Mean in Vital and Social Statistics. Proc . Roy . Soc 
29:365. (1879). 
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actual example. 5 This is illustrated in Table II on a distribution of 
reaction-times. Beginning at 70 (the real origin of the distribu¬ 
tion is assumed to be at 70) the step-intervals are numbered from 
zero to the end of the distribution. Under the log x column of 
Table I the value for each step is found and multiplied by the 
frequency for each step. This operation gives the values shown 
in the Flog x column of Table II. The sum of these values 
divided by /V (number of cases) gives the correction C. The 
average (log 6 f ) for the logarithmic distribution is finally found 
by adding a factor K to the correction C. The constant K depends 
upon the length of the step-interval. In this distribution, the 
length of the step-interval is ten. Looking under column K of 
Table I, we find that the value of K for a step-interval of ten 
units is equal to .69897. The geometric mean (G) of the dis¬ 
tribution is found by adding 70 to (GO * 

The process of finding the second moment and standard 
deviation (& 9 ) is similar to that for finding the first moment 
and the average. In one respect it is simpler: no correction has 
to be added for the length of step. The F log 2 * column is ob¬ 
tained by multiplying the value for each step in the log 2 x column., 
of Table I by its appropriate frequency. The sum of these divided 
by N (number of cases) gives the crude unit moment. The square 
of the correction C is then subtracted to give the corrected unit 
moment around the average. The square root of the corrected 
unit moment around the average gives the standard deviation^), 
and the antilog of Og gives the standard deviation ratio (cr r ). 

The formula for finding the average of a logarithmic distribu¬ 
tion is, 

log =- -j^- + K*C+K 

8 For those interested, the proof of the formulae for getting the average 
and standard deviation is given in an appendix at the end of this paper. 

8 One would expect the geometric mean to be different if the origin were 
taken at a point other than 70. An origin at 70 was assumed because it 
results in an extremely good fit to the distribution. In this case, 70 would 
correspond to the physiological limit below which no deviations ever occur. 
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TABLE II 


Time 

F 

Step 

F log K 

F log 2 * 

70 

1 

1 



80 

3 

2 

1.431363 

.682934 

90 

14 

3 

9.785580 

6.839826 

100 

40 

4 

33.803921 

28.567627 

10 

55 

5 

52.483338 

50.081832 

20 

60 

6 

62.483561 

65.069923 

30 

52 

7 

57.925054 

64.525229 

40 

46 

8 

54.100197 

63.626769 

50 

33 

9 

40.604814 

49.962150 

60 

28 

10 

35.805100 

45.785901 

70 

21 

11 

27.766605 

36.713541 

80 

14 

12 

19.064189 

25.960237 

90 

9 

13 

12.581460 

17.588126 

200 

7 

14 

10.019546 

14.341615 

10 

7 

15 

10.236785 

14.970255 

20 

4 

16 

5.965446 

8.896638 

30 

3 

17 

4.555541 

6.917653 

40 

1 

18 

1.544068 

2.384146 

50 





60 

1 

20 

1.591064 

2.531486 

70 





\80 

1 

22 

1.633468 

2.668219 


400 


400)443.381100 

400)508.114107 





1.270285 




C= 1.108452 

C = 1.228667 




K= .698970 






c^= .041618 



log 

G,= 1.807422 






C g = .204004 

For origin at 70 ... . 

G,* 65.8 





G - 135.8 

cr r = 1.59 




so 


T. iV. JBNK1VS 




table I 




Step 

Log x 

(Log xf 

Step 

Interval 

K 



1 

.00000 00000 

.ooooo ooooo 

1 

30102 99956 

2 

.47712 12547 

.22764 46917 

2 

.00000 ooooo 

3 

.69897 00043 

.48855 90669 

3 

.17609 12590 

4 

.84509 80400 

.71419 06972 

4 

.30102 99956 

5 

.95424 25094 

.91057 87668 

5 

.39794 00086 

6 

1.04139 26851 

1.08449 87247 

6 

.47712 12547 

7 

1.11394 33523 

1.24086 97921 

7 

.54406 80443 

8 

1.17609 12590 

1.38319 06496 

8 

.60205 99913 

9 

1.23044 89213 

1.51400 45481 

9 

.64321 25137 

10 

1.27875 36009 

1.63521 07719 

10 

.69897 00043 

11 

1.32221 92947 

1.74826 38633 

11 

.74036 26894 

12 

1.36172 78360 

1.85430 26993 

12 

.77815 12503 

13 

1.39794 00086 

1.95423 62678 

13 

.81291 33566 

14 

1.43136 37641 

2.04880 22253 

14 

.84509 80400 

15 

1.46239 79978 

2.13860 79042 

15 

.87506 12633 

16 

1.49136 16938 

2.22415 97018 

16 

.90308 99869 

17 

1.51851 39398 

2.30588 45856 

17 

.92941 89257 

18 

1.54406 80443 

2.38414 61255 

18 

.95424 25094 

19 

1.56820 17240 

2.45925 66473 

19 

.97772 36052 

20 

1.59106 46070 

2,53148 65837 

20 

1.00000 OOOOO 

21 

1.61278 38567 

2.60107 17684 

21 

1.02118 92990 

22 

1.63346 84555 

2.66821 91953 

22 

1.04139 26851 

23 

1.65321 25137 

2.73311 16157 

23 

1.06069 78403 

24 

1.67209 78579 

2.79591 12465 

24 

1.07918 12460 

25 

1.69019 60800 

2.85676 27889 

25 

1.09691 00130 

26 

1.70757 01760 

2.91579 59062 

26 

1.11394 33523 

27 

1.72427 58696 

2.97312 72744; 

27 

1.13033 37684 

28 

1.74036 26894 

3.02886 22909 

28 

1.14612 80356 

29 

1.75587 48556 

3.08309 65087 

29 

1.16136 80022 

30 

1.77085 20116 

3.13591 68471 

30 

1.17609 12590 

31 

1.78532 98350 

3.18740 26197 

31 

1.19033 16981 

32 

1.79934 05494 

3.23762 64129 

32 

1.20411 99826 

33 

1.81291 33566 

3.28665 48386 

33 

1.21748 39442 

34 

1.82607 48027 

3.33454 91850 

34 

1.23044 89213 

35 

1.83884 90907 

3.38136 59785 

35 

1.24303 80486 

36 

1.85125 83487 

3.42715 74737 

36 

1.25527 25051 
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37. 

1.86332 28601 

3.47197 20810 

37 

1.26717 17284 

38 

1.87506 12633 

3.51585 47414 

38 

1.27875 36009 

39 

1.88649 07251 

3.55884 72561 

39 

1,29003 46113 

40 

1.89762 70912 

3.60098 85775 

40 

1.30102 99956 

41 

1.90848 50188 

3.64231 50672 

41 

1.31175 38610 

42 

1.91907 80923 

3.68286 07246 

42 

1.32221 92947 

43 

1.92941 89257 

3.72265 73909 

43 

1.33243 84599 

44 

1.93951 92526 

3.76173 49312 

44 

1.34242 26808 

45 

1.94939 00066 

3.80012 13980 

45 

1.35218 25181 

46 

1.95904 13923 

3.83784 31768 

46 

1.36172 78360 

47 

1.96848 29485 

3.87492 51187 

47 

1.37106 78622 

48 

1.97772 36052 

3.91139 06589 

48 

1.38021 12417 

49 

1.98677 17342 

3.94726 19240 

49 

1.38916 60843 

50 

1.99563 51945 

3.98255 98299 

50 

1.39794 00086 


and, 6)- antilog (C + K) 

where G f is the geometric mean measured from an origin which 
may be other than the zero of the scale. The geometric mean ( G) 
measured from the zero of the value scale is, 

G - G, + (displacement of the origin) 

The formula for finding the standard deviation around the average 
is, 

9~J - N - C 


and, a r = antilog <j 9 


Summary of steps in the calculation of the average (logG,) 
by the short method: 

1. Banning at the origin, find the deviation of the mid¬ 
point of each step-interval from the origin in units of step- 
interval. 

2. Using Table I, find the log % of each step-deviation and 
weight it by its appropriate F (frequency). 
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3. Find the sum of the Flog Vs, and divide this sum by N 
(number of cases). This gives the correction C. 

4. Using Table I, find the value of K corresponding to the 
number of units in the step-interval. Add the factor K 
to the correction C to get the average (log G t ). 

Summary of steps in the calculation of the standard deviation 
(c^) around the averager log 6,) by the short method: 

1. Using Table 1, find the log 2 ?: of each step-deviation 
and weight it by its appropriate frequency. 

2. Find the sum of the F logV's, and divide this sum by 
N. 

3. Then subtract the square of the correction C to get the 
second unit moment around the average. 

4. Extract the square root of the second unit moment to ob¬ 
tain the standard deviation (cJ 9 ). 

APPENDIX 

Deduction of the Formulae for the Short Method 

Let log G be the logarithmic mean, x, the length of step, 

f t . f n the frequencies for successive steps, and ni l ,7n z 

. m v the mid-points of the steps for origin at zero. Then 

the first mid-point, m,, is at x/Z the second, rri z , is at 3x/2e. tc. 
For convenience, these items may be arranged in the form of a 
table._ 



H 

F log M 

F log Z M 

777, 

D 

f, !°g % 

f, !og z X ^Z 

m z 

1 

f z log s */z 

fgtog 


K 


f r> ,0 9* ^ * 


N logo - aj,**^* . at. 










L0GARTHM1C DISTRIBUTIONS 


51 


where G is the geometric mean, and <? 9 is the standard deviation 
around log (S'. 

We have, therefore, 

. 

Stating the logarithm of each fraction as the sum or difference 
of the logarithms of its factors, we have, 

, _ f, (logl-logZ+log z)+~+ f„ \log(Z-n-l)-log2 + log z] 

tog G - 77 - 


= t l££l tl!Sll±ZZl t friMQrll +log * - fog Z 


= L [f„ log(2r,-l i+ , x-tog2 
N 


Since 


, 27 f log x 

t°9 x * N 


and, 


. „ ZfloqZ 

!ogZ= /V^ 


Letting K m /og x - log Z 


and. 


- 27 f/„ fog(2n-t% 
c = Jj 


we finally have, 


log G * C + K 




34 


T. N. JENKINS 


Where C is the correction, and K is the constant indicated in 
Table I. 

For the second unit moment around log G, we have, 

^2 £ [fr, log 2 x\ (Z \f„ logC ^ zJ ) x\ \ 2 

9 = N ~\ N j 

- M. tiL (fjill. f °y Zx ^\ 

A f 

(z f v log(M^ 1 ) \ t 2Z \f?? % * f / i t 

\ N ) N '~ V 

Expanding again and collecting terms, we have. 


{f„[logrZn-lj\ ] 

f Z[f„ logfZ-n-l)} l 2 

N 

N 

K / 


. z\fy^Jog(Zn-l \I *| - 

N ~ ° 


In Table I, x= (2 n - 1) so that, 

log * = log (2 n- 1 ) 
and (log x) z = [ log (2 n- lj \j 2 


For each step, 1, 2, 3 .... v, the corresponding values of tc are 
1, 3, 5 .... 2 77- 1. Note that x as used in the table is not the 
same as x as used in the deduction of the formulae. 

The figures in Table I are accurate to ten places of decimals. 
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The log:* column consists simply of the logarithms of odd numbers 
from one to one hundred. K was computed by subtracting log 2 
from the logarithm of each number indicated in the “step interval” 
column. The (!oq ^column was computed by squaring fifteen 
place logarithms with the aid of calculating machines. This had to 
be done by indirect methods through the use of the simple algebraic 
relationship, (a + r) z = a z + Z&r + r z where < 5 . is the first part 
of the number and is the remainder. The table was computed by 
two different persons and checked on two different calculating ma¬ 
chines by each person. 




ON SYMMETRIC FUNCTIONS OF MORE THAN 
ONE VARIABLE AND OF FREQUENCY 
FUNCTIONS 

By 

A. L. O'Toole 
N ational Research Fellow 

In a paper published in this journal 1 the writer has developed 
a simple differential operator method for expressing any sym¬ 
metric function of thevariates x, , , • • • s^as a rational, 

integral, algebraic function of the power sums s ,, s z , * • ,s^where 

u/is the weight of the symmetric function and 

v k k k k 

S^Zxi ~x f 

The transformation to moments is then simply a matter of 
recognizing that i?u' k = s k if u k is the k th moment of the ^vari¬ 
ates with respect to the origin from which they are measured. If 
the origin is at the arithmetic mean of the 77 variates the prime may 
be dropped and then -rju^ , 

In the above mentioned paper the variates x t are of the serial 
distribution type, but, of course, not necessarily integers. The 
extensions to the case of more than one set of variates and to 
frequency functions now suggest themselves. It is the purpose of 
this note to discuss these problems simultaneously. 

Suppose that two sets, of pgvariates each, , x 2 , • • • ; x n 

and^ ; -y w are given and that ( L « 1 , 2 , .... ,77) are cor¬ 

responding pairs. Modifying the partition notation used in the 
previous paper the symmetric function to be considered may be 
written in the form (a^'a^ay* . b™* b z *)i.e. the 
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where 

a /' a 2' a 3> * "* fy, • • 77 „ 7 ^, 77 ^, • • • 77 ^, 777 ^, 77 £, * . . 

are positive integers and where 

a, > a z > a 3 > • - * > O. 


77/ A T7 Z + 77$ + . • • = 777, -f 777^ + 777 3 + • • * • 

e g- (3*ZA5l)=toZ.4?l)*Zx]*jxtfy* yj , t Uj4k. Obvious- 
ly the order of the parts in the bipartite notation will be important, 
corresponding parts being equidistant from the left of their respec¬ 
tive sections of the partition. The double partition will be said to 
be of weight a, n f + a z rt 2 a a s 77 #+ . ... in a: and of weight 
w z = 6 , 777 , + b z rn z + b 3 m 3 -a • • * iny. There will be no loss of 
generality if it is assumed that us, £ . 

By a procedure similar to that employed in the first chapter 
of the paper already referred to, it may be shown that any sym¬ 
metric function of the type defined above can be expressed as 
rational, integral, algebraic function of the symmetric functions 

s u > s iz * s zi> .- 

where 

^hk X C 9 

Moreover, the terms of this function will be isobaric of weight wj 
inX, isobaric of weight u^iny, and hence isobaric of weight 
w, + w z in x and y together, 
e. g. Multiply s Zf by itself: 


$t * (xfy, + X *Y Z + 


”21 


+ X. 


Z y y 

77 y 77 J 


» (x*y**■ - *x*y£) +*(*?*&,yp*?*\y, 3&~) 


" s ^ 


+ 2 Cz z i 2 ). 


(Z 


* l 2 )* (l 


JU 


- s * 2 


)/z. 
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each term being of weight 4 in 2 in y, and 6 in x and y together. 
It is possible then to write 

s r7, 77- 77- i 77?, / 7T7 2 T7> 3 * 

fa, a 3 A 4 b 3 ‘ ‘ ^ = 


» * ’ 3 CJ > / 3 Wf w z ) r ^ 

where f stands for a rational, integral, algebraic function of the 
sums (or product moments) 3/y , /«* 1, • • % , 7 -^, • • •, v / z , 

isobaric as explained above. Suppose that a new pair of variates 
x n +i s °c, y n +j */6 are introduced. Obviously siy becomes S;y 
+<*jy8 J . Hence applying Taylor's Theorem f becomes 

f* (*0d„ +</S\t„ +■■■ * • - • - +*"’/3 


+U/3<± II +«/6*d /;l + - 


■ * ■ 




+ 


+ +*t,fi Z d l2 + ■ ■ ■ + 


W >A ■)**& t> I 

■+«'/& -Wfe' 

JL 

all other terms being identically zero, where d;y * 4^y an< i 

= /*/, " • ■ ,Wi , J*l, * • •, vv z . 

Using the multinomial theorem and collecting coefficients oi<*,‘/3 J 
the above expression may be written in the form 

(l + 't'/S D" + ■*,f3 z D /z + ■ +cJ’jS J D £ : * —)f 


where 


( 1 ) 


ID,, ~d t/ . 


'-'-'a " 

r A* -< 
L Du * 


■'/s 


//» 

'tz> 


*/3 


Vzz*<i*z + <*7t/Z> 


LX* 

r 


* 3 / > 


'tA « 


&23 * ^23 + °£ u d tZ , 
&3Z = ^3Z + <*// <*2 f 
&4t s 
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%s- 

: &/S > 





+ 

cC ff 

d, 3 


~^33 

+• 


d zz 



4~ 


*3, 

A, 






d/y d L Z j d t *; • • • • 

In. .y ILL diE -ill _ /--i-.-vv 1=1 ... iv 

L *J /c t ! k j /(J .. . ’ J > ,J ‘ - *■' 

where! k, i, + k z i z + k 3 1 3 + .= i, 

%J, +k Jz +k 3 J 3 + . m J> 

4 , u • * 3 . • • , Ji’-lz -4* ' ■ *» k >> k 2‘ k 3 ‘~ being positive in- 

tegers. The inverse relation is given bv 

' k.t V *,/•••• 

where L = /,••--, w, , j= /, ■ • • , , 

k , l ,+ k z i z + k 3 ‘ 3 + -‘-*‘, 

k, // * k ziz + k 3 J3+ - ■ = h 

k,+k z + k s + • ■ • • • = 

4, 4. 4, • - V J, J z . j 3 . • •, 4* <4, ••• being positive in- 

tegers. 

The effect of the new variates c^don 

ra/ 7 ' <2/* ^ . . . . b,”" b”* k 3 m * ■■■) 

is to replace this symmetric function by 

(a?' "• ap b”> b™* b™'-.) 

+ 2 V •' • b™’’ 1 b™ z b™*. • • ) 

• • 4"*4 m *' 1 b 3 m3 ••■>•-. 

Hence replacing /" by (a^'a^a^ ■ ■ ■ b, ^'b™* b/” 3 then 

(1 * <*/3D„ + ct,/3*D )St + z /3l> z , * ■ 

)(a.r'*? z °r 3 • • - b ™‘ b ™* b ™ j ■ • • •> 
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= fara ” 1 a, ** • • • b'bpb™ 3 - ■■■) 

+« a ‘ / ek q ” 3 ... b™'~'b £ bj 77 - 3 ■ ■ ■ ) 


4 **- ' •> ' 

Equating coefficients of like terms in «< and/Sit is seen that 


A n,b,( Q ?' a t £a ” 3 ■ 


■) 


4" 7, ' / 4’ w *4” 7jr - - 

■X 

Da z b 2 {&?* &** 

.. . 4 "'4^4 

™3. . . 

*(ar <2?‘''a7* ■ ■ • 


■ ), 


etc. and 

D hk (a?'a?*a? • • - 4~'4 ■ - J-O if both 

h and/rare not among a 3t * • * and b,, b 2 ,b 3t - • * 

Respective ly. Hence also 

j%, • tnp&*o 

if t Sj - ; are not all respectively less than or equal to 

77 /, 77 2/ • - v 77^ and also respectively less than or equal to m„ 

77£, 7 n Jt • * ;7?^svhere b t , t z> t 3 , - * - *, are positive integers or 
zeros; that is, if even one of the values of t is greater than the cor¬ 
responding 77 or m then the effect of the operation is to give zero. 
This last property of the D operators is important not only in 
minimizing the work of computation as will be seen in the illustra¬ 
tions given below, but will be of fundamental importance in the 
theorem to be stated in the closing paragraph of this paper. It 
should be noted here that the multiplication of the operators is com¬ 
mutative. 

To illustrate the application of the operators consider: 
a) C£ll 2 )=c,s zf s t/ + cr £ s sz where c, and c 2 are constants to be 
determined. These are the only terms which satisfy the weight 
conditions. Operating on the left with P z/ and on the right with 
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4/ s ives C 1 - 1 ) * C, s n , i.e. S" : C,s /M hence c, = 1 . 

Operating on the left with D 3Z and on the right with d 3 Z +d n d zl 
gives O* c,+ C z , hence C z =-1 and thus (Zl.l z )=s z , s„ - s 3£ . 

^ C ! S 63 * C 2 S J5Z S u +C 3 S 3! S 1Z v C 4 S 4Z S 2! + C J S A/ S SZ 

+ C 6 S 4I S H + C 7 S 32 S 3I + C S S 3! S St S U * C 9 S 2l • 

These are the only terms which satisfy the weight conditions. If 
the expressions for all the symmetric functions of lower weights 
are known then to determine the constants it is sufficient to operate 
on the left with 

D n > Qn -5/ * Q/* Q 3 and on the right with theii 
respective equivalents in terms of cL However if the expressions 
for the symmetric functions of lower weights are not known then 
it is perhaps simpler to operate with 

^12 > 1 ^ 2J > ^31 ^21 ^ 7 / > ^ 3f ^32 » ^ 4 / ^22 ' ^42 ^ 2 / * 

<d n ct S 2 , d 63 on the right and with their respective equivalents in 
terms of D on the left, <d /z and D fZ give 0= c 3 s Sj and hence 

c 3 = 0 . df, and D* givtO*£c 6 q $/ and c 6 *0 • d ZJ andgive 

0= 6 c g Sjj and = O L Similarly d 3f d Zf d t/ and D 9f Z \j D u 

give c a * 1 . d 3! d 3Z and D §j (QfQ D n ) give c 7 ^ - 1 . d^ t d 3? and 

c s = 0 . d z ,d 4Z and D z/ (.D 4Z -D g , D„ ~D* J2) 
give Q= - 1. d„ d sz and D„ (D sz -D 4/ D„ -D M D a ,) give c z *-l. 

nd D 63 -D sz q r n si Z> z -D^-D^ 
+2D 3 I D zi D ii +D 3 2) /Z give c, = 2. Hence (321.1 3 ) =-Zs 63 - 


%2 5 // 


S 4Z 3 Zi 


3 32 S 3I * 3 3t 3 2/ S n ' 


Suppose that y^ = i, k=l,2, -, r?. Then s;j is simply s/ 

and j have no meaning except when L-j and then J>n and 
du become the operators Z>; and <2/ respectively, of the earlier 
paper. 

The operator relations for any number of sets of correspond¬ 
ing variates are now obvious. For instance, in the case of 3 sets 
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a£ 

y ( , xi the result is 

. , h, * ^ 

n... -X iiirkj. izj^z i- 3 J 3 k 3 _ 

^7 7^7 h;i ■ • ••' 

i = 2,-?,., M'V • 

J * 1,2, . 

* = 2,-?,-, iv„ 

h,i, + h z i z *h 3 i 3 + .-2, 

hj,*h»k + h 3h* -* 2» 

h,k,+ hgkg*h 3 k 3 + - • • • =/r, 

h,l 2 . L 3> ■‘j,JgJ3'' '> k r k *’ k *' "'’ h >' h * h 3- 

being positive integers , w p u^,w^the weights inxy-z respective¬ 
ly of the symmetric function , 

d iJk =b/bs ijk , s iJk = Z^ xiyjxf ■ 

Returning now to the case of two variates as and y, suppose 
that as/ takes on only integral values for /* 1,2, .... , 7 ?and that 
y/ *■ /^V^where f fxi) is thought of as the frequency cor¬ 
responding to x *■ %/ If, further, b f ** b z ~ b 3 * • • - 1 

then the operators developed in this note give the expressions for 
(p* b ^c ^■ Oof the earlier paper when each serial X^is re¬ 
placed by More generally, let y/ * f(x-^)A Xi represent 

the frequency of x in the interval AX£ . If x, takes on only in¬ 
tegral values then of course * 1 ,4-1 ,2, . . . . , ., 77 . 

Consider 


(32.11)=£ x 3 t xjf(xi)f( xj)Ax l A xj , L*j, 

ft 


*ZL xjf(*i)A x i . 47 xj f(xj)A x- -Z xf f*fe;)A xf. 
4=i J-l i*l 


If the lower and upper bounds for x are respectively <2 and b then 
in the limit as ??becomes infinite and the maximum Axi % i*l 9 Zy,r) 
approaches zero, the last summation on the right approaches zero, 
ffx) being an ordinary frequency function. Thus in this limiting 


case 
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•11) ~ f f x 3 i Xj *i) f(Xj)dx; dxj, itj, 

^CL JQ, 

r & 3 r t> z 

« / %i ffxOdxi * / xj f(xj) dxj . 

Jq. Jcl 


77 Is f * 

In general, under these limiting conditions, any summation £ x,;f 

r • ■ r t L ~ 1 

(x L )A xi approaches zero, if r is greater than 1, k and r being 
positive integers. For let € v be the maximum A x k for specified 
77. Now x k f r (x t ) i Miox i- 1,2., 77 . Hence 

£ x i k f r (x i )A^ r l i m£ A~x r L iMC r ^i Ax l * Me£%-a) 
U1 L~ 1 


which approaches zero with <5^ * This establishes the well known 
statement that, the values of ^ being independent, 




f ( 


-ffxijdx; dx ; - dx 

L k 4/ Lp c k 






For, under the above limiting conditions, all those terms which 
contain a sum -J^xpf^xi) A x L with k greater than 1 must 
vanish. By the last property of the D operators given in (2) it is 
seen that there is always only one term which does not contain such 
an ; and from this term arises the product of the definite in¬ 
tegrals. 




A GENERALIZED ERROR FUNCTION* 


By 

Albert Wertheimer 
I. INTRODUCTION 

Given a set of observed values {[ ( / * 1, 2, 3, . . . , n .) 
obtained from n observations assumed to be made on the same 
quantity, /, under the same conditions. We seek to determine 
two functions f(P t //) and cp(P f //^such that 


( / *1,2,3, ... , 77 ) 

defines /?asa unique value assigned to the observed quantity; and 
<$>(£? // )dti gives to within infinitesimals of higher order the 
probability that if another observation is made, the observed value 
will lie in the interval 

JiH ± -ti+dtL. 

Gauss determined the (p function to be the so-called normal 
error law namely, 

, -h z (P-Si) z 
J * ce 

on the basis of the following assumptions. 

(a) The product to be a maximum with respect to p. 

Thus 



Zip tog 4>(P,tJ-0, 
£ 3^2 tog <p(FH,no. 


♦Presented to the American Mathematical Society, December 28, 1931. 
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(b) The unique value p is the arithmetic mean of the observa¬ 
tions. Thus 

(c) The probability function is a function of (P- Thus 

<p(p, V s <PO=>-*i)- 

Poincare? on the basis of the first two assumptions only ob¬ 
tained the error function 


where 




vs(Vfp) 


dW 

dp 


+ p 


dY 

dp 


•o. 


In this paper we assume the unique value p to be defined by 
a function satisfying certain conditions, and obtain on the basis of 
assumption (a) a more general error function from which the so- 
called normal error law, the Poincare function, and other forms 
of the error function as well as the Pearson curves are obtained 
as special cases. 


2 . The unique value p. 

We now make the following assumptions: 

P The unique value p is defined explicitly as a function of the 
observed values in the region a 6. <b . Thus 
P~r(t h i z J 3 , * • , 

where F is single valued, continuous with continuous deriva¬ 
tives up to the second order. 

IP The value of p is independent of the order in which the obser¬ 
vations are obtained. Thus F is a symmetric function. 

Ill: The change in p due to a change in one of the observed values, 
say t is a function of p and ^ only. Thus 

o. ■ 0, (F - *<>■ 


1 H. Poincare, Criicitl des Probabilites (1912), p. 171. 
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IV ; If p is regarded as a function of a single variable, say 4' > 
while all the others are regarded as constants, then with re¬ 
spect to this variable p is a monotonic function and is not 
constant in any portion of the interval in which it is defined. 
Thus 


Fj *0 


for all i *s. 

We have then for the determination of the (j) function the 
two equations 

(1) fjp&tfteV-o. 

( 2 ) O, 

which must be simultaneously satisfied for any set of values in 
the region defined. 


3. The g function. 

We will now show by means of the following theorems that 
if F satisfies the given conditions, then there exists a unique func¬ 
tion g(P /j)$uch that the equation 

Lg(P, 

is identical with equation (2). 


THEOREM I. 


Given a function of r? variables, 


continuous with continuous non-vanishing first derivatives in the 
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region defined, such that 

then ijf (F, x. c ) must be in the form of a product of a function 
of F and a function of ~ti. Thus 

t i (F,x L )~co (F') / 6 V* V- 


Proof: 
We have 


F x *xJ « yrj (F, x l ) if J (F x J ), 


and 

FxU 1 -iri(F,x i )ir i (F,x i ). 

Hence: 

j^CP xO _ t F J CF,x J ) _ _ jr^fF.x V 

if l (F, x l ) \tr J fF,x J ) 77 

Integrating, we get 

{oglJf^F.x^sJ:r^(P)dF ^^(x 1 ). 

from which it follows that 

^(Fx 1 )* co(F)/3 i (x i ). 

THEOREM II. 

Given a -function of w variables, 

Fix'xfxf - ■ x”), 

continuous with continuous non-vanishing first derivatives in the 
region defined, then in order that there shall exist a unique func¬ 
tion T. (F) such that 
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£CF p ) = Zu i (x l ), 


it is necessary and sufficient that 

^ =<Yx‘) J(x } ). 

Proof : 

Necessary conditions :— 

If the £ function exists then the functional matrix 




£ 

u U 

rj 

' * 7 


r x ‘ 

P x * 

F x 3 ■ ■ 

. .f? n 

r z 

11 

must be of rank 

one. 

Hence 



f*x L CC L x i 


= - 7 - = << L (X, L ) <=<r J (x J ). 

F x s u U 

Sufficient conditions :— 


We assume that 




Then we have the following identities: 

d z » F - 


■ £<>9 F, S °' 


d x‘ d »j y F J 


b) — - Lit- 

Fj ~ F Jk - Fjki 

ioxk, 4 c or j and 


c )V/\r«c F c - r tt M - F i fP’ijk F i - Puj p k ). 

where for convenience of notation, 


s f 7 *.*> ^ik"~ 1 x** 


etc. 
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Making use of a), b), and c), it is easily shown that the 
functional matrix 



is of rank one. It follows that 


d) fir*™ 

Now the differential equation that defines the £ function is 

I.J ■ l„ Wi+t, Fij -O, 


or 



p. p. 9 

1 J --(K (P) from d). 


Hence f (F) is uniquely determined, namely, 
l (F)=kJ e ~ fA(F)aF dP+ H. 

where A' and H are constants of integration. 

Now, for our problem, if F satisfies the given conditions, 
we can apply the two theorems in succession and we have that 
there exists a unique function 

1(F)= 


But due to the symmetry of F all the u L functions will be the 
same and we have 

1(F) = 


If we now define 

gCp,4)»4it(p)-*(*t>- 
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we have 

$ g(p. *i)~Z(p)-Z(r)~o. 

4. General Error Function 

We may now write equations (1) and (2) in the form re¬ 
spectively 

£ ~§p togtPCp, t-)=o, 

£ g(pJc)~0. 

These equations must be simultaneously satisfied for an arbitrary 
set of values t; in the region defined. It follows that they are 
identical. Thus 

fpt°9 <PCp, { t ) = ]JrCp)gCp, t t ), 

where If ip) is an arbitrary function. 

Integrating, we get 

(3) e.) - erti) e / '^p)gCp, ti)dp 

where &///) is an arbitrary function. This is our general error 
function. In order to insure a maximum we must have 

( 4 ) t(p)g P i O. 

5. A Generalized Normal Function 
If we now make the additional assumption that 

ft)- <P\g(P, -ti)}, 

*6 * O 


we have 

% 
9 P 



A GENERALIZED ERROR FUNCTION 


n 


Expanding and simplifying, we get 

/<■) 

e <9t ~ 9p 



Differentiating with respect to 2/, we get 

/ Of . \_ 

9 -ii df i Wv 9p ~ H - 
Integrating and substituting in (3), we get 
<pfg)= ce kg . 

From (4) we have 

f<9p>o, 

Hence 

( 5 ) (p(g)=ce~ h<3 . 

We shall refer to this function as the “Generalized Normal Error 
Function”. 


5. Application to Special Cases 
If p is defined as the arithmetic mean, then the region con¬ 
sidered is - co < -£i < + <*>, and 


gCp, 


The normal law is obtained directly from (5), and from (4) we 
have 

<p(p,i i ).e« i )c ; + (r,>Cp - f ‘ :iap 
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where 


dW dV „ 
+P ^- = o, 
dp dp 


which is the same as the Poincare' function. 

For the geometric mean, the region considered is 


o< #<<*» 


and 


9(p, p-fog t L . 


Hence, from (3) 

0(p, 4 )*erti)e ; 


■h z {tog p - fog fi } A 


and from (4) 

(p[tog p- fog 4* } =ce 
The Geometric mean as the most probable value, as well as its 
generalized normal curve are used for certain astronomical photo¬ 
metric measurements. 1 

‘For the harmonic mean, the region considered is 
(O < < c* ) 


and 

gCp, 

Then from (3), we have , . 

and from (4), we have . % 

7. Remarks About the Generalized Normal Curves 
Let us consider briefly some characteristics of the generalized 
normal curves corresponding to the following three special cases. 


1 Whittaker & Robinson, Calculus of observations (1924), p. 218. 
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(a) Arithmetic mean: Here 

tpfpj)-ce 

From this equation we see that 

< P(p,P+€*')' , <P(p,p- 
<P(p, /) =<pfp+c {+£*), 

cp{ P) C>) -ce~ h * p \ 

@(pj °°) = O. 

(b) Harmonic mean: In this case 

from which we see that 

(P(p,p - £*)* <p( p, p * £ V 
(pfpj) < Q(p+C*, {*€*), 

<P(pO) - o, 

<J)(p. ao) - ce ~P z 

(c) Geometric Mean: Here 

<PCp,f) 

and 

(pCp,p- £*)< fl{pp+ € z ). 

(p(pO) *o, 

tffP; <*>) " 

Instead of treating these normal functions as three distinct 
error laws referred to the same measuring scale, we can regard 
them as a single error law with reference to three different measur- 
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ing scales (seelsketch). This viewpoint helps to explain the above 
mentioned characteristics of these laws. 

The law for the arithmetic mean applies when an object is 
measured with a uniformly graduated scale in p . The char¬ 
acteristics for this law follow directly from the consideration that 
the scale is everywhere the same. 

The law for the harmonic mean holds when an object is 
measured with a reciprocally graduated scale, as for instance 
measuring the volume of a gas with a pressure gauge graduated 
for volume. In this case the scale becomes crowded as p in¬ 
creases, and hence 


and also 

0(p,p+6*) > <P(p,p-€*). 

For large values of p it would take only a small error in the read¬ 
ing of the scale to make an infinitely large error in the value of p 
and hence (fiCp^w) does not necessarily vanish. On the other 
hand the zero point is at an infinite distance and hence $(p, 0)-0. 

The law for the geometric mean holds for measuring objects 
with a logarithmically graduated scale. The same remarks as for 
the harmonic mean apply here, except that in this case it would 
take an infinitely large error in the reading of the scale to make 
an infinitely large error in the value oip. Hence (p(o t oo)=0. 

8. The Pearson Curves 

Leaving out the subscripts in (3), we have for the general 
error function 

(pCpJ) ~Q({)e 


f&Cp)g(p> f) dp 
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we have 

d p}- 


Thus for a given pthe curve approaches the i axis asymptoticallj 
Let us now impose the condition that 


d(p 

di 



=0, 


then 

§*« u ( {)f^)dl 


Integrating, we get 


OfS)=ce 


fu(Z) {fi/r(E)df} d/ 

J 


so that 

/, -Mtj{Mt)df}df+f1r(p)gCp,t)dp 
( 6 ) (p(pj)^ce 

and 

| ^ r 0U,jfvr(t)dt, 

where t is a variable of integration. If we now take as a special 


u.^ * [b. + b,f* } , 


case 
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we have 


w.0 ez± ' 

d£ * b. +b t l + b;i* 9 


which is the differential equation defining the Pearson system of 
frequency curves. For this case, (6) reduces to 


ce 


f 




p-^ 


dt+i ft fa) dp 

t 


from which we see that by a proper choice of <f ip) we can 
choose the value of p for which the product 77 (p, ) 

shall be a maximum. 

It may be noted that the differential equation defining the 
Pearson curves is often derived on the basis of the assumptions 
that the curve shall approach the { axis asymptotically, and have 
only one maximum point. 

In conclusion, i + appears that if we restrict the function that 
defines to satisfy the assumptions given in this paper, and also 
impose the condition that/? shall be the most probable value in the 
sense, that the product 

shall be a maximum with respect to p, then (3) is the most general 
form of the error function. 
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Scales, corresponding to the arithmetic, geometric, and 
harmonic means. 


* r 


Arithmetic Mean — Uniform Scale 



Harmonic Mean — Reciprocal Scale 







A COEFFICIENT OF LINEAR CORRELATION 
BASED ON THE METHOD OF LEAST SQUARES 
AND THE LINE OF BEST FIT. 


By J. B. Coleman 


Given N points in a plane, corresponding to N pairs of values 
for two variables, X and Y , we find the line of best fit and 
the line of worst fit, by the method of least squares*. Then we 
derive a cofficient of correlation based on the sum of the squares 
of the distances of the points from these two lines. 

The line of best fit is in the line such that the sum of the 
squares of the distances of the points from it is a minimum. The 
line of worst fit is the one from which the sum of the squares of 
the distances of the points is a maximum. We shall refer to them, 
respectively, as the minimum and maximum lines. 

For convenience we take the origin at the centre of gravity 
of the points, letting x and y denote deviations of X and 
Y , respectively, from their arithmetic means. 

1. The two lines pass thru the arithmetic means of the X's , 

and of the Y’s . 


y= mn + b may represent any line of the plane. The dis¬ 
tance, d. i , of a point, ( x L , y L ) , from the line is 


y. - mx L - b 
JT+rrfz 


The sum of the squares of the distances of 


the N points from the line will be 


♦For a general discussion of this method of fitting when 2 vam- 
bles are involved, see, Pearson, Karl, “On Lines and Planes of Closest Fit 
to Systems of Points in Space”, Phil. Mag., 6th series, vol. ii, 1901, P. 559. 
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_ z Zy z + m z Zx 2 4- Nb z - 2 mZ xy-2bZy + 2mbZx 

<( 1 ) La- -;- 5 - 

v ’ l+m* 


Using Ex = Zy = O, in the condition for maximum or mini¬ 
mum values in (1), the condition reduces to J 2 -O , and the 
theorem follows. 

2 . To find the slopes of the two lines. 

Equation (1) now becomes 

(2) l/. Zy*-2 7rrjZky_ + Tr?fZxf_ _ 

1+Trj z 

The condition under which (2) will have a maximum or mini¬ 
mum values, is that 

m z Lxy+m (Zx 2 -Ly z ) -2T xy - O. 

This condition is satisfied by two values of rn , namely; 


(3) 


Zy Z -Lx z +j(Zx z -Zy z ) z -tdfExy) 1 
2Zxy 


(4) 


Zy*-Zx z - ](Zx*-Zy*)* + 4(Zxy) z 


2Zxy 


It is found by considering the second derivative that (4) is the 
condition under which Zd 2 will have a maximum value, and 
/ 3) is the condition for a minimum value. 
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The equation of the minimum line is y= m f x , and that 
of the maximum line is y-m 2 z, The value of m } and 
vn z are those given in (3) and (4). 

3. The maximum and minimum lines are perpendicular to each 
other. 

That 7n f - -i/777 g is easily shown from (3) and (4). 
Further m } has the same sign as Z xy , and m z , the 
opposite sign, since their numerators are, respectively, positive and 
negative. 

4. The minimum line lies between the two lines of regression, of 
coincides with them. 

If Zjxy > O, 

7T7 " 

1 ~ ZZxy 

since 2Z x z 2Zy z = (Exy) z . 

Hence tv, ^ ZC y z /Zxy ^ rn , the slope of the line of re- 
gression of ^ on y . 

Rationalizing the numerator of (3), and noting that £x 2 Zy 2 = 
(Zxy) z , we obtain 

^ ZHx y _ 

the slope of the line of regression of y on X . 

In the same way it may be shown that if ZL xy < O , then rr?^ i 
777, £ rn 

The condition that rvj be equal to the slope of one line of 
regression is the same that it be equal to the other, so that the 
minimum line coincides with both lines of regression, or else lies 
between the two. 
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5. To find the sum of the squares of the distances of the points 
from the minimum line; also from the maximum line. 


L<'t u be the distance of a point from the line y= m^x 



Hy^-Z m, H x.y+ m, z Zlx. z 

1*7r?* 


Substituting for m from (3) and reducing, we obtain 

(5) 

Ld z =\Zx z +Zy z - JcETx *-Xy z )*- + 4(Lxy) z ]/*, 


Similarly, if D represents the distance of a point from the 
line y = 777 ^ r , we obtain 

(6) I1D Z = \l2x z +Ey 2 +J(Lx z -Zy 2 ) z + 4(Z!zy)^/ z * 

6 . To find a coefficient of linear correlation. 

Let q -jHd Z /2ZP Z . Substituting from (5) and ( 6 ), 
and reducing, we obtain 


(7a) <?= 


Zx z +Ey 2 - 4CZxy) z 

ZJL x 2 22y 2 - {2Zxy) z 


or 


(7b) o. iJ EiMZIJEHL 

Zx z +Zy z i JfZ^ z -Zy z ) z 74(2vy) 2 


q represents the ratio of the root-mean-squares of the distances 
of the poirt from the minimum and maximum lines. This ratio 
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is a measure of the closeness of fit of the points to a line, and 
should furnish a measure of linear correlation. The value of q 
may vary from 0 to 1. q = 0 indicates that the points are 
all on a straight line, hence that the correlation is perfect. It is 
of interest to note that when q = 0 , (7b) gives Z 

C = Cl = - 1 • When q is 1 the mean squares of the distances 

of the points from two lines at right angles is the same, and linear 
correlation is lacking. Hence 1-q would furnish a coeffi¬ 
cient conforming to the customs that it have the value 1 for 
perfect correlation, and O for lack of correlation. 

Values of q found from (7a) or (7b) would involve the 
units in which X and Y are given. Hence these forms would 
be objectionable, in that q could be made to assume different 
values for the same data, by changing the units in which x and 
y are expressed. However, this objection may be removed by 
taking d x and <3" y as units in which to express X and Y . 
(7b) then reduces to 

q = I ' 

The coefficient 1-q may now be expressed as 


/v- \ZLxyMxCty 

J N^\Zxy/cX x &y\ 


The sign of the coefficient should agree with that of the slope 
of the line to which the points are fitted. Hence, when the value 
for q has been found it should be given the same sign as 
the slope of the minimum line. But the slope of the minimum 
line is determined by that of 2Z xy , so that the sign given to 
l-q should be that of Z!xy. 
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The coefficient 1-q may be expressed immediately in terms 
of the Pearson coefficient, r , which is equal to 22, vy/Ncr^dy 
Making this substitution in (8) we have 


C-/-9 -J- 


l+\r 


In the table are shown values oi 1-q corresponding to some 
given values of r . The values for 1 -q z have also been listed 
corresponding to the same set of values for r . The maximum 
difference occurs between t-<q and r when r* 839 and 1-q = 
704, a difference of .135 by which 1 - q is smaller. 


r 

({ 

l 

1- 

i 

1 

1 

.99 

.929 

995 

.95 

840 

.974 

.9 

.771 

.947 

.839 

704 

.912 

.8 

.667 

.889 

.7 

.580 

.824 

.6 

.500 

.750 

.5 

.423 

.667 

.4 

.345 

.571 

.3 

.266 

.462 

.2 

.183 

.333 

.1 

.095 

.182 

0 

0 

0 
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A STUDY OF THE DISTRIBUTION OF MEANS ES- 
TIMATED FROM SMALL SAMPLES BY THE 
METHOD OF MAXIMUM LIKELIHOOD 
FOR PEARSON’S TYPE II CURVE 


By John L. Carlson 

The object of this paper is to study the distribution of esti¬ 
mates of the parameter of location for Pearson’s Type II Curve, 
estimated by the method of maximum likelihood from small 
samples. 

R. A . Fisher has assumed, 1 and Professor Hotelling has 
proved 2 that in large categories of cases the distribution of an 
optimum statistic approaches normality as the sample size in¬ 
creases. This normality has been assumed to hold for optimum 
statistics in general whether calculated from large samples or 
small ones, and it has also been assumed that optimum statistics 
have minimum variance and always give better fits than do sta¬ 
tistics calculated by the method of moments. That this is the case 
whenever the sample is large and the distribution of optimum 
statistics normal is made plausible by the reasoning of R. A. 
Fisher. 3 In case the sample is small, however, there may be reason 
to doubt that the normality of distribution of optimum statistics 
holds, and that the other conclusions hold. It is with this phase of 
the subject that we shall be concerned in what follows. 

Before entering into the topic under discussion it will be con¬ 
venient to review some of the more elementary facts regarding the 
curve with which we are to be concerned. We shall take first the 
general equation for the curve in the form, 


(i) 




fx-rr?)* 

a* J 


x On the Mathematical Foundations of Theoretical Statistics, R. A. 
Fisher, Phil. Trans. Series A, Vol. 222, 1922. ?p. 309-368. 

2 The Consistency and Ultimate Distribution of Optimum Statistics. 
Harold Hotelling, Trans. Amer. Math. Soc. Vol. 32, No* 4. Pp* 847-859. 
s Ibid, Pp. 328-368. 
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and determine the effect of variation of the constants. 

When p=0 the equation reduces to the straight line y =y 

When p = + 1 the equation is that of a parabola with 
y « y e at the point x ^ rn and the ^ intercepts at the points 
x * i a . It of course meets the x axis at an angle. 

When p - + Z the equation y=0 is of the fourth degree 
in ^ with double roots at the points x * t a. 

In general when p= r? the equation y=0 is of degree 
277 in x and has two sets of n -fold multiple roots x=± a. 

Since our curve is to he a probability curve it will of neces¬ 
sity have unit area, and this fact makes it possible for us to 
evaluate y 0 in terms of the parameters a and p . In order to 
do this we shall perform the integration below. 


Area 


/ x+a 

y dx : 

jo- a 




¥ 

cos 


2p+1 8 dd 


whence 


but now since 


( 2 ) 


/' 


77 

Z 

COS 


„„ „ Jn rr-°&) 

6dd= z 


we have 


/ =2ayd C1 e±LL 

y ° 2 r(p^) 


v rCp^il 

° ajn rfp-t-1) 


and so 
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Therefore 


(3) 


y- 


asfrr r(p+1) 



, p 
fy-m) 

a * 


Formula (2) we shall find to be of value a number of time 
and formula (3) is the form in which equation (1) will be use< 
throughout the remainder of the paper. 

It will be worth while now to consider the likelihood functioi 
JL together with its first and second partial derivatives with re 
spect to 77? . (We shall hereafter refer to rn as the parameter 
of location, a as the parameter of scaling, and p as the para 
meter of shape.) We are to use 777 to denote the estimate of m 
obtained by the method of maximum likelihood, in accordant 
with the convention introduced by Fisher, 1 and it will be with thi; 
parameter that we shall concern ourselves in this investigation. 

We have from (3) on the preceding page 


(4) 






and so 


(5) 


bL . 

5n? 


n 

zpZ, 

1=1 


a z -('x i -m)z 


and 


( 6 ) 


77 


$-L =-z P Z 




ilbid Pp. 309-368. 
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At tin's point we shall stop to consider the effect of variation 
of the parameters a and p upon our estimate of 777. Let us 
first consider p . Since the method of maximum likelihood is 
here merely the method of the diffeientiai calculus it follows from 
a consideration of equation ( 5 ) that our estimate of 777 will be 
independent of p . for any particular sample. Such is not the 
case when we consider a . however, for any change in a allows 
a change in the variance of 777 for the particular sample. 

We shall find it advantageous to cover as much of the the* 
oretical work as possible before embarking upon our experimental 
check and its great amount of numeral calculations, for it is only 
by means of a check between theory and experiment that we are 
able in the present state of knowledge, to judge of the applicabil* 
ity of the method of maximum likelihood to small samples. Of 
course it will be necessary to consider the distribution of our 
estimates of 777 , in order to make this statistic of practical use, 
and it is desirable to know the theoretical variances of % the 
arithmetic mean, 777, and the experimentally obtained variance of 
the distribution of our estimates of 777. The first of these we can 
obtain from theory, the second from an approximation valid only 
in the limit, and the last by means of calculation based on actual 
sampling. 

We shall be concerned first with the theoretical variance of 
x . It is well known that the variance of x is equal to the 
variance of the distribution divided by rj . We must first, there¬ 
fore, find the variance of the distribution. 

From ( 3 ) page 88 ,it follows that 


2Cf pt *2 - jx z (l-%z) dx 

a aJTrr(p+/) J Q 


*- 1%^‘e -ccs^e\dS 




ajffrfp+f) 


\C05 e - cos 0 \ 
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remembering now (2) page t 87 
2 


z & 
a * 


2.p+3 

whence it follows that 


( 7 ) 


" 77 ri(2/o+3) 


We shall no-w calculate the limiting form of the variance of 
777. Fisher has proved 1 that if the distribution of optimum sta¬ 
tistics is normal the variance of an optimum statistic is equal to 
the negative reciprocal of the mathematical expectation of the 
second partial derivative of the logarithm of the likelihood with 
respect to the parameter in question. 

We may write, therefore 

1 = ~ 4 T7jD rfp+i) f a (a 2 +z z ) 

C% tzjrrr(jo+1) J a 



_ -4 np rfp+jr ) 

~Trr ajnrfp+l) J o 
and so again referring to (2) page 87 we have 


OjL 


(£cos 


2p-3 

&- cas 


whence 

■ w e)dg 


_1_ 2 rtp {o+j;) 

a s fo-l) 

hence we have 


( 8 ) 


r* a z (p-D 

m -np(2p+l) 


The efficiency of the mean is then 

( 9 ) E- 




p(2p+l) 


<3? 


The next problem with which we must concern ourselves is' 
that of experimental verification of the assumptions under discus* 


1 Ibid Pp. 327-328. 
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sion. The problem is briefly that of choosing a number of small 
samples from a population which obeys our law of frequency, 
estimating 777 for these samples, and then calculating the va riant 
for the distribution. Also we shall draw an histogram of the dis¬ 
tribution and observe the general type of the distribution, so near¬ 
ly as that is possible from our samples. 

The problem of choosing our samples is not the least of our 
difficulties, for we can not take all types of samples. They must 
be of a very special nature: they must be from a population of the 
Type II. In order to accomplish this it will be necessary to have 
a table of areas corresponding to given values of x for the Type 
II Curve. There is no such table available to the knowledge of 
the writer, and it is therefore necessary to construct the table be¬ 
fore we can procede with the choosing of the samples. After the 
table has been built, we can with the aid of Tippett’s Tables, 1 
choose our samples with ease. The manner of choosing is as fol¬ 
lows. Take the numbers from Tippett’s Tables as areas under 
the Type II Curve and look up in the table of areas the values of 
X corresponding to the smallest area containing the area found 
from the Random Numbers. This will give the value of x to be 
taken. Since we will take four digits let the fifth digit determine 
the sign. If it is odd take the sign - , if it is even take the sign + . 

There are two ways in which a table of this nature can be 
prepared, and the method employed must in any case be de¬ 
termined by the degree of accuracy attainable and the amount of 
labor involved. One of these methods is that of the calculus of 
finite differences, determining the zero order differences by means 
of algebra, and from these by the process of addition building up 
the table. This method is best used when a dependable listing 
adding machine is at hand. The other method is that of direct 
integration, and it is found that with the aid of two calculating 
machines this is by far the quicker. It was this method that was 
applied in the building of the table on page 92 and 93. 


^Tracts For Computers, No. XV. 
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Table of Areas Under Pearson’s Type II Curve, Correct tb 
9 Places of Decimals. The Areas are included between ordinates 
located - x units from the parameter of location. 

Constants. y a = p*2, and a-1 


X 

Area 


X 

Area 

0.00 

0.000 000 000 


30 

529 661 250 

01 

018 748 750 


31 

545 084 843 

02 

037 490 001 


32 

560 298 291 

03 

056 212 259 


33 

575 295 077 

04 

074 920 038 


34 

590 073 828 

05 

093 593 867 


35 

604 625 820 

06 

112 230 292 


36 

618 947 482 

07 

130 821 880 


37 

633 034 148 

08 

149 361 229 


38 

646 881 319 

09 

167 840 964 


/39 

660 484 657 

10 

186 253 750 


40 

673 840 000 

11 

204 592 289 



686 943 358 

12 

222 849 331 



699 790 921 

13 

241 017 673 



712 379 067 

14 

259 090 168 


44 

724 704 358 

15 

277 059 727 


45 

736 763 555 

16 

294 919 322 


46 

748 553 612 

17 

312 661 995 


47 

760 071 688 

18 

330 280 859 


48 

773 151 488 

19 

347 769 104 


49 

782 281 572 

20 

365 120 040 


50 

792 968 750 

21 

382 326 904 


51 

803 374 696 

22 

399 383 261 


52 

813 497 651 

23 

416 282 613 


53 

823 336 081 

24 

433 018 598 


54 

832 888 688 

25 

449 584 961 


55 

842 154 414 

26 

465 975 552 


56 

851 132 442 

27 

482 184 334 


57 

860 009 702 

28 

498 205 389 


58 

868 223 379 

29 

514 032 918 


59 

876 335 911 

W~ 



\WKKKKSiM 

884 160 000 
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The table on the preceding page was built by direct integration 
of (3) page 88, with and <z=l These values for the 

parameters were chosen so as to save as much labor in calculation 
as possible, and at the same time maintain the desired shape of the 
curve. There has of course been % no less of generality in setting 
a* l but we have limited ourselves quite definitely in using the 
value p=2 . The accuracy of the table to 9 places of decimals 
has been assured by calculating all values to 13 places and then 
determining the maximum error over the whole range which was 
625 in the 13 place due to the use of the decimal equivalent of z /? 
in the third degree term. 

Our table of areas being complete and the problem of sampling 
thus solved, we must next consider* the task of estimating 777 . 
Since we have already taken a * 1 and p ~2 in building our 
tables we shall continue to use these values throughout the work. 
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The next question to be settled before beginning our work is the 
size of the samples with which we are to deal. The case 77 = 1 is 
of course of no interest for the best estimate of a single observa¬ 
tion is the observation itself. The case 77 - 2 is likewise of no 
interest for in this case the arithmetic mean coincides with the solu¬ 
tion by the method of maximum likelihood. Therefore it is the 
case 77 * 3 with which we shall be concerned. Our results are 
not then trivial, and at the same time we have the case which is the 
easiest to deal with, since the number of numerical calculations for 
each sample is reduced to the minimum. 

Before going ahead with any attempt at solving an equation 
such as (5) pageiSo, it is well to have in mind a picture of what we 
are actually trying to accomplish. With such a picture in mind we 
are better able to realize the difficulties of the situation and so are 
better able to cope with them. To this end we have included a 
graph of the problem involved w T hich will make clear at a glance 
just what must be done to find the true value of 777. 

We have drawn, page 96 , from plotted points the curves 

representing L , and the three terms of . Also we 

have drawn in the asymptotes to the curve which are of significance. 
If we are able to find the point at which =O we have the 

O 771 

solution to our problem. This involves solving a fifth degree equa¬ 
tion. We shall use Newton's method of successive approximation. 
Fisher states 1 that in some cases at least, we may start with an in¬ 
efficient statistic and by a single approximation obtain an efficient 
one. Whether or not this is the case for small samples we shall 
see when our calculations have been analyzed. 

It will be seen upon examining the graph that as is allowed 

to vary each of the therms of varies from -a 0 to -/■ a o , 

1 Theory of Statistical Estimation, R. A. Fisher, Proc. Cam. Phil. Soc. 
Vol. XXII part 5. Pp. 708-709. 
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and so their sum also varies between these limits. It will be seen 
that the asymptote corresponding to the largest allowable value of 
37can be found by adding the value of a to the smallest observation, 
and that the asymptote corresponding to the smallest allowable 
value of m may be found by subtracting the value of a from the 
greatest observation. It is thus evident that as dispersion in the 
sample increases the variance of m must surely decrease. That this 
fact is of fundamental importance in choosing our first estimate 
of m will be seen from consideration of the following case, since 
it is well known that in the case of a curve with a real finite pole 
Newton’s method may lead us to erroneous results. Let us con¬ 
sider the sample consisting of the three observations x,-- ,99, 
s + .99, ^**.99, for which the arithmetic mean is 

2 = v- .33. If now we take as our first approximation m f = 2 
we will immediately lead ourselves to an erroneous value of m 2 , 
our estimate of rh . That this is the case will be easily seen by 
means of the check given on page 94 for using a - 1 we locate 
the asymptotes at t .01. The true value of m located be¬ 
tween these asymptotes, is not therefore even to be approached 
should we take m f = X . This is an extreme case and fortunate¬ 
ly not to be anticipated very often, or the arithmetic mean would 
he deprived of any value whatsoever. We should always be sure 
that the difference between any observation and the value of m 
that we are using is not greater than the value of a when dealing 
with the Type II Curve. This holds no matter what our manner 
of attack may be. 

We shall now be concerned with the calculations for our 100 
samples of 3. All of the data are tabulated in such a way as to be 
.self-explanatory, and so we shall not bother to give sample cal¬ 
culations. The next ten pages cover these calculations. The dis¬ 
cussion is continued on page 104. 
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^L 

77?,=% 

/ 3 

2 WZ? 

$L 

dm 

| Xj-rn 

i+(x,, -m) 2 

dW* 

| /Vf^-7 Vf 

m z 

d*L m dL 

WJdnF 

msSL11 

mam 

mm. 

mS&SSm 

1 

U7 


+ .254667 





2 

-23 

-.0700 

- .164200 

+ .009840 



BIIwwl 

3 

-.15 


- .080520 





1 

SR1 


- .083882 





2 

-.15 

+.0S33 

- .212064 

f .016730 

BKMEEB 

- 3.438830 

+.049737 

3 

+.34 


+ .312676 


-1284716 



1 

+.67 


+ 254668 


- 1.190833 



2 

-.11 

+.4300 

- .762281 

-.177943 

-2.573777 

- 5.080876 

+.465022 

3 

+.73 


+ .329670 


-1.316266 



1 

-.54 


- .291230 


-1.248262 



2 

-.33 

-2700 

-.060216 

+.018882 


-3.655622 

-264835 

3 

+.06 


+ .370328 


-1.396495 



1 

-41 


-.481688 


-1.718538 



l 

+ 11 

+.0067 

+ .104445 

-.029803 


- 4.101439 

+.013936 

3 

+32 


+ .347440 


-1.350292 



9! 

BBS| 


-.408701 


-1.479856 



9 

SrJ 

+.0967 

+.023312 

-.024353 

BIM! 

-3.858903 

+.103011 

9^ 

HUM 

JHI 

+.361036 


-1.377417 



1 

S3 


4.770451 


-4.424677 



2 

51 


- .308021 

+.853498 

-4277016 

- 9.978709 

-.131268 

3 



-.609932 


-1.277016 



n 

DO 


+ .815850 





2 

-64 

-.3700 

- .291230 

+207992 

-1.248262 

-4.758692 

-.326292 

3 

-.66 


-.316628 


-1.292329 



T 

-.12 


+ .361036 


-1.377433 



2 

-.48 

-.4433 

- .036749 

+ .011911 


- 3.666198 

-.440051 

3 

-.73 


- .312376 


- 1.284716 



T 

--49 


+ .167774 





2 

-.77 

-.6533 

- .118311 

+ .002661 

mTfrm 



3 

jgj] 


- .046802 





r 

S3 


+ .066896 


BEBSSi 



2 

HSl 

-.1367 

+ .182285 

-.009546 

-1.098764 

-3308885 

-.139555 

3 

-.38 


- .258727 


gggggl 



r~ 

ISRil 


+ .128735 


BESSEl 



2 

-.72 

-.5267 

- .200836 

-.005134 

-1.119217 

-3.182127 

-.528283 

3 

-.46 


+ .066967 


BIQfcW 



1 

933 


- .036749 


IBISEBEEl 



2 

-.65 

-.2633 

- .454723 

-.387058 

-1.589322 

-4.343556 

-.352411 

3 

+.16 


+ .104414 





~ 

warn 

■ 

+ .128767 





2 


-.2667 

-246729 

-.010034 

-1.179312 

-3363601 

-269775 

3 

2H 


+ .107928 


mmam 
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1 



777, =X 

zf V 

n 

dL 

J.m 

3 

r *l-™ x 

l+te L - 

d z L 

dm 2 

w z 

/-/%' 77?) 2 

%/ h(*i 




HI 

1 

m 


+ .149926 


- 1.066950 



15 

2 


-.2367 

+ .036749 

+ .002713 

- 1.004049 

- 3.177717 

- 235846 


3 

-.43 


- .183962 


- 1.106718 




1 

-.29 


- .360623 


- 1.376579 



16 

2 

+.12 

+.0333 

+ .0 87663 

- .021856 

-1.022996 

- 3.585193 

+.039096 


3 

+.27 


+ .251104 


-1.185618 




1 

SEI 


+ .142798 


-1.060774 

■HUH 

mu 

17 

2 

-.39 

-.2400 

- .153452 

-.000653 

-1.070113 




3 

-.23 


+ .010001 


-1.000300 

■hmh 

mm 


1 

+ 26 


+ .428687 


-1.526159 


■ 

18 

2 

-.42 

-.1100 

- .342958 

+ .025513 

-1.341557 

- 3.878581 



3 

-.17 


- .060216 


- 1.010865 


EH 


1 

+ .23 


+ .423670 


- 1.514353 



19 

2 

-.21 

-.1367 

- .073695 

+ .029076 

- 1.016264 




3 

-.43 


- .320905 


- 1.300082 



■1 

1 

+ .37 


+ .053552 


- 1.008563 


■ 

Wfch 

2 

-.09 

+ .3167 

- .483174 

- .025787 

- 1.673112 

- 4.150227 


■ 

3 

+ .67 


+ .403835 


- 1468552 




a 

+ .21 


+ .178665 


- 1094805 


■ 


u 

-.76 

+ .0367 

-2.181131 

- .983165 

-12.252377 

-17.060464 



a 

+ .66 




-3.713282 | 


mm 


i 

K3 


- .139303 


-1.057853 



22 

2 

♦ .23 

-.0633 

+ .320905 

+ .020958 

-1.300082 

- 3.434721 

mmi&m 


3 

-.22 


- .160644 


-1.076786 




1 

-.44 


- .142798 


- 1.060744 



23 

2 

-.62 

-.3000 

- .356506 

+ .084156 

-1.368375 

- 4.378292 

-.280779 


3 

+ .16 


+ .583460 


-1.949243 




1 

-.28 


- .481639 


-1.658197 


■mi 


2 

+.20 

+.1233 

+ .077153 

- .038750 | 

-1.017823 

- 4.063031 



3 

+ .45 


+ .365736 


-1.387011 


mu 

—ri 

1 

-.13 


+ .107928 


-1.034813 


■ 

2S 

2 

-.56 

-.2367 

- .361036 

+• .155433 

-1.377417 

- 3.564903 



3 

mm 


+ .408551 


-1.152673 


mu 


n 

BEI 


+ .111347 


-1.024632 


mm i 

26 

H 

!9kcl 

-.0100 

+ .153452 

- .014051 

-1.070113 

- 3.322761 



H 

3s2| 


- 278850 


-1228016 


mm 


1 

-.79 


- .603260 


-2.011377 


PH 

27 

2 

[gi 

-.3200 | 

+ 242846 

- .105746 

-1.142660 

. 4.344870 



3 

Syj 


4 * 254668 


—1.190833 


mam 


1 

-.08 


~ .273953 


-1.028078 



28 

2 

+ .43 

+.1760 
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DISTRIBUTION OF MEANS 
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DISTRIBUTION OF MBANS 


In order to analyze the results of the calculations on the pre¬ 
ceding ten pages it is necessary that we find the theoretical vari¬ 
ance for £ and w from the formulae derived for this purpose 
*lier in this paper. 

Prom (7) page 1 90,we get after setting a= 1, p = 2, and 77*3, 

< 10 ) ^ = ^-=.047619 

and from (8) page 90 after similar substitutions 

(11) 033333 

Prom (9) page 90 we get for the efficiency of the mean when 

(12 ) p=2 £*.70 

Now by actual calculation from the ungrouped data the mean 
square deviation from zero which we shall designate as 

23x/^ 

(13) — - .048611 
and 

(14) 047612 

Comparing (10) and (13) it is evident that such a difference 
can be said to be well within the limits of random sampling. The 
difference between (11) and (14) is of such magnitude that we 
can not say that it might be expected in the course of random 
sampling. Now since T77 z is our estimate of m it is evident that 
either a single approximation by Newton’s method is not adequate 
to give the best results or the approximation, 

£ a z (&-l) 

* r)p(2p+l) 
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to the variance is not valid in the case of small samples. That the 
latter seems to be the case the writer firmly believes. The reason 
for this belief lies in the fact that in a subsequent case a sample of 
3 was examined by Newton’s method, and starting with m, 
= *■- - .07 the values m z = - .066879 and m 3 = - .066791 

were obtained. There is not sufficient improvement here to cause 
one to suppose that by taking a third approximation we would 
obtain a variance in keeping with the one derived from theory. 
Also considering the mean square deviation from zero for 5? and 
m z it seems that the gain in accuracy to be expected from the use 
of the method of maximum likelihood solution instead of the arith¬ 
metic mean is not sufficiently great, in the case of samples of three, 
to warrant the additional labor involved in calculation. We must 
be sure, however, that in using the arithmetic mean we are using 
an approximation to m which complies with the qualifications given 
on pages 95 and 96. A graph of the distribution as found from 
the calculations is given on page 106. The histogram represents 
the grouped data while the smoothed curve is a rough approxima¬ 
tion to the actual form of the distribution. 


Histogram Data 
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The totals have been used in the histogram which has been 
forced to be symmetrical so as to give the effect of a s amp le of 
twice the size. 

The tabulation of the histogram data draws attention to the 
great excess of samples having negative values for x and m z . 
This has caused the writer no little concern. In examining the 
signs of the observations we note that there are 183 - and only 
117+ values. Assuming + and - values to be equally likely 
this gives a deviation from 150 of 33 or 3.81 times its standard 
error which is incredible. It seems therefore that Tippett’s num¬ 
bers are not random in this respect, and that it perhaps would have 
been better to toss a coin to determine the signs. 

As a final check and in an effort to place the type of the dis¬ 
tribution the value of 0 Z has been calculated, and found to be 
4=3.056 

t3, was not calculated as the excess of negative signs would lead 
to an erroneous value. There is every reason to believe that 
should be zero. These facts suggest that the curve is very near to 
the normal curve, but perhaps slightly more leptokurtic. But, 
why, if this is the case, there is not better agreement between (11) 
and (14) page 104 the writer is unable at the present to state. 


A NEW THEORY OF DEPRECIATION OF 
PHYSICAL ASSETS 

By Robert E. Moritz 

1 . The various methods employed for the computation of 
the depreciation of a physical asset are as many devices for re¬ 
covering, by means of a yearly charge to production during the 
life-time of the asset, its reduction in value. The methods differ 
according to the answers given to such questions as the following: 
Should the yearly charge be based on original cost or on replace¬ 
ment value? Should the yearly charge be uniform over the life¬ 
time of the asset? If not uniform, should the depreciation charge 
be proportional to the actual reduction in market value, or, in the 
case of new plant, should such charges be minimized or wholely 
deferred during the earlier years when the plant is trying to estab¬ 
lish itself? Should interest be disregarded, or should the yearly 
charges to production be accumulated with interest? If interest 
is to be considered, should the rate be the effective rate on the 
capital employed in the business, or the commercial rate? All 
these questions and others have received careful consideration. 1 

Quite as important as those already mentioned are two other 
considerations, which have been generally ignored or overlooked. 
There is first the rather obvious fact that depreciation is insepar¬ 
ably tied up with the question of repairs. Depreciation is greatest 
when the asset is new, when repair charges are negligible, and it 
diminishes as repair costs grow. The problem of depreciation, 
therefore, cannot be adequately treated aside from repairs. Within 
certain limits depreciation may be compensated by repairs. The 
yearly depreciation charge to production, therefore, should not be 
based on original cost or renewal cost alone, but on original cost 
plus costs of repairs during the life-time of the asset. 

In the second place, the life-time of an asset is not a constant 

1 See Saliers, E. A., Depreciation, Principles and Applications, New 
York (1922). 
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as is generally assumed, but is a variable which, like depreciation 
itself, is definitely related to the repair function. Aside from ob¬ 
solescence, the value of an asset could be kept practically intact 
indefinitely by sufficiently increasing the outlay for repairs. There 
is always a> threshold period of time when it is a question whether 
to scrap or to continue to repair, and frequently this threshold 
extends over a period of years. In short, the life-time of an asset 
is generally an unknown quantity, the determination of which re¬ 
quires the solution of an equation which expresses the condition 
that the annual charge to production, necessary to recover the 
original cost of an asset together with all repair costs, shall be a 
minimum. 2 

The present paper is an attempt to treat the problem of de¬ 
preciation from the point of view here suggested. The problem*, 
then, is to determine the life-time of an asset such that the annual 
charge to production, necessary to cover original cost and all re¬ 
pair charges, shall be a minimum. 

2 . Let us denote the original cost of the asset by C , the 
cost of repairs during the first x years by T2(z)> then the total 
outlay to l>e recovered is C + £?(z). Furthermore, let U(x) 
denote the average yearly charge to production necessary to re¬ 
cover the total outlay in x years. Then, disregarding for the 
present all interest considerations, we have 

( 1 ) U(x)= 

The notation U(x) suggests that, in general, the unit charge ta 
production will be a function of x . 

We shall now define the life-time of an asset as that value of 
^ which will render the value of U(x) in (1) a minimum. 

-J. S. Taylor, A Statistical Theory of Depreciation, Journal of the 
American Statistical Association, Vol. 18 (1923), p. 1010, is, I believe, the 
first writer who recognized in part the principle here set forth. He calls 
attention to the fact that the useful life of a machine depends both on the 
manner of distributing depredation charges and on the assumed interest rate. 
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The analytical conditions that a function of x may have a 
minimum are that the first derivative of the function with respect 
to x be zero and that the second derivative of the function with 
respect to x be positive. If, as is customary, we denote the first 
and second derivatives of U(x) by U fx) and U (x) re¬ 
spectively, and those of P(x) by P (x) and (x) respective¬ 
ly, we find on differentiating 


U'fx)- 


x z 


> 


U "(x) = 


x z J?"fx)- 2P(x)+2C_ P‘(x)-2U'fx) 


from which it is evident that the life-time of an asset must satisfy 
the two conditions 

(2) xprx)-c-j?(x)*o, P7x)>o 

In short, the life-time of an asset is given by that root of 
the equation C which will make P"(x)> O\ 

For example, let us suppose that the repair function is givenj 
by the equation Pfe) = ax 2 + + c . Then P'(x)=2qz +h, 
T?Yx)*2a> and the conditions (2) reduce to 

= C + c, a>0 

The life-time of the asset is therefore equal to J(C+c)/a 
provided the coefficient a is positive. It is interesting to observe 
that x, is independent of the constant b . 

3. In the preceding discussion no allowance was made for 
the salvage value of the asset. Let us denote the scrap-value of 
the asset after % years by S(x), then the average yearly de- 
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predation, interest again not considered, is 

(3) um- 

and the conditions which will make U(x) a minimum are 

(4) P"M>S7x). 

If the scrap-value is a constant, both S&Jand S‘'(z) 
vanish, and the life-time of the asset is determined by 

(5) xP'M-C-RM+SM = O t P "M > O 

The conditions (2) include (4) if we replace JPfx) by J?fe) 

- S(x) } that is, if in the outset we diminish the repair function by 
the salvage value at time * ; to include (S) it is suffident to re¬ 
place C , the original cost of the asset, by C-S , the difference 
between original cost and scrap value. With these modifications 
we may treat (2) as representing the general case. 

4. To avoid any possible confusion, let us denote by Tfx) 
the total outlay to 1>e recovered by uniform annual charges to 
production during the life-time of the asset. Taking account of 
the residual value S(x) we see that 


( 6 ) T Yx) = P Yx)-SYx) 

and (3) and (4) take the simpler forms 

(7) U(x) = TMM xT(x)-T(x)= 0 9 P"te)>S'H 

From the first and second of the equations (7) follows: 

( 8 ) T'fc)*U(z) 


which may be appropriately called the life-equation of an asset 
since its solution yields the life-time of the asset as defined in 1. 
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5. When the repair function and the salvage function are 
known, the real roots of the life-equation may be found either 
by direct methods or by methods of approximation. However, in 
the great majority of cases which occur in practice the value of 
T(x) is given only empirically, from the recorded experience re¬ 
lating to the asset in question, and the data available may not lend 
itself to analytical treatment. In all such cases the life-time of 
the asset may be determined approximately by means of the fol¬ 
lowing simple graphic method. 



Let AB (Fig. 1) represent the graph of the equation 
y ~ T(x,) , constructed in Cartesian coordinates. We shall call 
it the total outlay graph, because the ordinate y of any point 
(z, y) on this graph represents the total outlay during the period 
of x years if the asset were scrapped at the end of this period 
The straight line, OP, joining the origin O to any point P 
on AB , we shall call the uniform charge to production graph. 
It enables us to determine at sight the aggregate amount that must 
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be charged to production during any given period of time in order 
to recover the total outlay OP for the time ^ on the basis of 
uniform distribution over the entire period ^ . If x, is ex¬ 
pressed in years, the ordinate U V , of the point on OP whose 
abscissa is unity, will represent the uniform charge to production 
per year, which is required to recover the total outlay for # 
years. 

Now it is obvious that this unit charge UV will vary with 
the slope of the line OP . It will be least when the slope is least, 
that is to say, when the point P is such that the line OP is 
tangent to the total outlay graph. The abscissa, OC , of the point 
of contact, T , is then the life-time of the asset under considera¬ 
tion. 

To determine the life-time of an asset, interest considerations 
being disregarded, we need therefore only construct the total 
outlay graph A B , then draw the tangent O T , and finally mea¬ 
sure the abscissa of the point of contact T . 
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(Fit*. 2)shows the construction when T 60 has the forms 
10*** 25 * *• 1000, 

10 x z + 1000, and 
10 ^ - 25 ^ ^ 1000 

resj)ectively. In each case the life-time is found to be 10 years, 
which verifies the theoretical conclusion of 2: that the life-time is 
independent of the coefficient of X . 

We have seen from graphical considerations that the uniform 
charge to production will be a minimum when its graph is tangent 
to the total outlay graph. This condition is precisely the condi¬ 
tion expressed by equation (8), which asserts that when x is 
the life-time of the asset T'(x) , the slope of the tangent to the 
total outlay graph, is numerically the same as the yearly charge 
to production. 

6. We now come to consider the problem of finding the life¬ 
time of an asset when interest at a specified rate is to be taken into 
account. In this case, the various items that make up the total 
outlay, as well as the component charges to production, must be 
replaced by their present values at some arbitrarily chosen epoch, 
as say, the epoch zero. 

Let us attempt an analytical solution of the problem. LetZl t 
represent a small interval of time. The outlay during the interval 
from t to i+AP is T(P+At) -T(t). If the specified rate 
of interest is, l , and if we represent the discount factor by l/(l + i) 
the conventional symbol V , then the present value of 

T(t+At)- T(t) 

at the epoch O has some value between \/t 

and [T(t+At)- T(if\y t+M , let us say 

where S has some value between O and 1. The 
total outlay during the time t , evaluated for the epoch O , is 
therefore 

( 9 ) 


C+P\f(t+Ai)-T(t)\y 


t+6tAi 
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the sum extending over all the time intervals between 0 and t . 

Now At w here & M is the greatest 

among all the fractions &. We have, therefore, 

\rCt+At)-T(i]\S >\T(t+At)-Tft)\^ &M 

( 10 ) 

** 

If the intervals A t are all equal and their number n , then 
At = t /n , and as n is increased indefinitely At approaches 
0 Then t + Q n A t approaches t , and we see from (10) that 
(9) must have the same limit as 

(in c+ZL \r(t+A t) - 7w;] k * 

To determine this limit we write 

T(t.At)-nt). T( ****i-™) . at 

At 

where the first factor on the right represents the difference quotient 
which approaches T'Ct) as a limit as At approaches O as a 
limit. With this relation introduced into (11), we obtain for the 
present value at the epoch O of all the increments of outlay dur¬ 
ing the time t , the intervals of time being infinitesimal, 


( 12 ) 


T(t) = C * L irrnt 
At *o 


£ 


Tft+& tj- T(t) 
At 


V*At 



In a like manner we may derive an expression for D (t) , 
the limit of the sum of the present values at epoch O of all the 
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charges to production during the time t apportioned at some uni¬ 
form rate U to each of the intervals A l . The charge ap¬ 
portioned to the interval from t to i +At is U.At, its present 
value at epoch O is U. \z* 4S ^*At- The present value of the sum 
of these amounts for all the intervals A f between O and i is 

Z U.V t+e ‘ Ai At 


which for infinitesimal values of A t has the same limit as 
2L U.v t t At, so that finally 

_ r t 

(13) D(t) - Limit ZL U v f At * U att. 

At J Q 

Let U fx) be the value which must be assigned to U in 
order to recover T(x) , the total outlay for x years through a 
uniform charge to production, interest considered. Then D(x) 
must equal T fa), that is, 

Ufa) ■fv i .di=c+J v t rctj di. 


from which 


(14) 


Ufa) 


C+J v* T'Ct)-dt 

TV* di 


The life-time of the asset is that value of ^ in (14) which 
will make Ufa) a minimum. The derivative of U(x) with 
respect to x must therefore vanish. Differentiating (14) with 
respect to x and setting the result equal to 0, we find 

di\ 


*T'fa)jVdt- \c*J Tt rt i 
o L * 


from which 


C+f v*-T'(t)-dt 

r* ~+ 

f v*-dt 


(15) 


T'fa)= 
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which is the life-equation of the asset, interest considered. 

7. In deriving equation (13) we apportioned the charges 
to production for an interval A t and found the sum of the pres¬ 
ent values at epoch O . DCi) is the limiting value of this sum 
as the intervals At are indefinitely diminished. If, as is cus¬ 
tomary, no charge is made to production until the end of the 
year, this single charge will be the aggregate amount of the con¬ 
stituent portions for the separate intervals A t , accumulated with 
interest to the end of the year. The charge for the interval from 
t to t + A t is U A t , its amount at rate L to the end of the 
year is U. (1 + i)*.At , where t is the time to the end of the 
>year, and the equivalent single charge at the end of the year is 

U = hrrntLU (L+if.At = U kmtZCl+i)*At 

At + O At ->o 

( 16 ) -u/fUO* eu-ignzr 

o 

8. As an example let us again take T(t)~at +bt+C, 

then 

A t 

T'(t)=2ai+b Ji/*dt « v^/og ^\ 0 = (AD/tog K 

t "v ■ 

v*. T(i) dt - 

= [^aty^dfy £ -Jfj/log v- 2a fA-l)/(fog r) z 

( 17 ) 

T(t) = c+\2at\/%-b(\/ -/JJ /Jog ^-2a/yff)/(fog v)/ 




( 18 ) 


D(t) = U ( i /*-!)/ fog v, 
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and (15) reduces to 


(19) 


v*-/og v* = 1 + 


c Clog \/) 2 
Ta 


While the life-equation (19) cannot be solved algebraically, 
it is evident that an approximate solution for could be ob¬ 
tained from a list of tabulated values of the function \/ **- /ogv*. 
When such a table is not available, an approximate solution to any 
desired degree of accuracy may be obtained as follows: 

We may write for v t e K where e is the base of the 
natural system of logarithms. (19) then takes the form 

(20) e x/og K - z/og is = 1+c(log y) z /2a. 


On expanding the first term of this equation into a power series in 
X , and simplifying the result, we have 

x z + x V- log \/)/3+ z zJ 2 //Z + z/joq v)*/(50 + • • • - c/a, 


whence 


_ c/a __ 

~+z7!oq \s)/3 /o<g v) z //Z -t-z^/fog v') 3 /60+ • • • 

Now for all ordinary rates of interest log 1/ is necessarily very 
small, so that if we denote successive approximations of ^ by 
x n x 3 * etc., 

.,,!/* r </ a -w* 

x, = (c/a) , x = 


*3 m 


/+x,(/og y)/? 

- __i 

f* x z Cog is)/3 /og y) ‘V'/xfJ 




'/2 


r_2^_ 'V/* 

][** 9 f/og is)/ 3 + z*Oog <s) z /2 +X 3 3 )/ag is) 3 /60 J 

, etc. 
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Let us take the special case, previously considered in 5., when 
c * 1000, b = 0, £> = 10, and let the assumed rate of interest be 
6 percent. Then 

log y = -O Ofd 269, (tog //= O0OJ397, 

I/log y = -1716/ 788, J/(/og sj z ?294 926 967, 

and we find 

T(t) * /Ot z +/0O0, 

7(t) » /OOO - 343.24ty f - 9890 ?4(y t -jJ, 

DCtJ - !7 /6Z U(l- y ± J, 

and the life-equation is 


v*-/og y x * / /69 764 



The first four successive approximations for z give 

X t */0, * z = ///< = //6>f 
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The value x * II07 substituted in (14) and (16) give us 
U{x)-22IA5 and D(x)= 226.0? 

This value of U(xJ substituted for U in the expression for 
D(i) gives 

D(t)= 3800.5 (/-?*), 

which represents the present value at epoch O of the aggregate 
momentary charges during a period t at a rate such as to recover 
the total outlay 11.07 years, the theoretical life-time of the asset. 
The momentary rate is 221.45 per year, the equivalent single 
charge to production at the end of each year is 228.03. 

(Fig. 3) shows the graphs of the two equations. 

J(t) = IOOO -343 24tv*- 3890 

and — 

D(t) *?800.3 

The abscissa of the common ordinate of the two curves represents 
the life-time of the asset. 

9. It appears from (Fig. 3) that at the point common to 
the two graphs, the graphs have a common tangent as well as a 
common ordinate. To see whether or not this is a general prop¬ 
erty let us trace the changes in the total outlay and total charge 
to production functions when interest is taken into account. 

In the first place it is evident that the increments of the 
ordinates of both of the graphs in (Fig. 1) must be replaced by 
their present values at the chosen epoch. If this epoch is the 
effect in question will be to shorten progressively the ordinates of 
both graphs. The charge to production graph will then be no 
longer a straight line but some convex curve, while the total out¬ 
lay graph will go over into another graph which is less concave 
than the original graph. But both graphs will continue to rise 
indefinitely as we proceed from left to right because the incre¬ 
ments of their ordinates, while decreasing indefinitely remain 
positive. 
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In (Fig. 4) let Tft) and Dft) represent respectively the 
total outlay graph and the charge to production graph, interest 
disregarded, Tft) the total outlay graph, interest considered, 
and Dft) the charge to production graph, interest considered, 
through any [joint El on Tft). The ordinates on Dft) repre¬ 
sent the present values at epoch O of the momentary charges to 
production during time i at a rate such as to recover the entire 
outlay during the time corresponding to the abscissa of the point 
El . This rate is measured by the initial slope of D(t) , the 
slope of Dft) when t=0. 

Let us follow the changes in this slope for the various posi¬ 
tions of the point El as it moves along Tft) from left to right. 
It is evident that this slope at first decreases, also that it cannot 
keep on decreasing indefinitely, it is therefore plausible that it 
will ultimately increase, reaching a minimum value at the point 
P where the Tft) curve and the D(t) curve have a common 
tangent. The abscissa of the point of contact, P 9 is then the life¬ 
time of the asset under discussion. 
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10. The foregoing considerations, however plausible, are 
open to objections, because we have reasoned from graphs result¬ 
ing from the assumption of a special law governing the repair 
function. Different assumptions might give rise to essentially 
different graphs. We shall, therefore establish the conclusions 
above arrived at, by an analytical proof, which is independent of 
any assumptions regarding the nature of the outlay function. We 
shall prove the 

Theorem: If the rate U of a uniform charge to produc¬ 
tion curve is a minimum, this curve is tangent to the correspond¬ 
ing total outlay curve, and the abscissa* of the point of contact 
represents the life-time of the asset. Conversely, 

If a uniform charge to production curve is tangent to the 
corresponding total outlay curve, U is a minimum. 

To prove this theorem, let y-T(t) be the equation of the 

total outlay curve, y « Dft) =U f the equation of the uni- 

o 

form charge to production curve, and x the abscissa of a point 
common to the two curves. 

_ r x 

Then T(x) -DM from which 

( 21 ) U=TM/f^v t dt 

Since by hypothesis U is a minimum, its derivative with respect 
to x must vanish, that is 

(22) T?x)f 

From (22) and (21) follows 

(23) T'(z)*v*T(z)/f 

This shows that at the point common to the two curves their 
slopes are equal, they have therefore a common tangent, and since 
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U has a minimum value, must represent the life-time of the 
asset. 

To prove the converse theorem we observe that if the two 
curves have a common tangent at the point t= z , 

(24) T(x)-D 66) .(jf r*ctt 
and 

(25) T(x)*&'C*)* 

Substituting the value of U from (24) in (25) we find 

T(x) = v* T(jc)/J*v 

from which 

T(x)T(») = O. 

o 

lJut by (22) this is precisely the condition that U is a minimum. 

11. In most cases which arise in practice the analytical 
method of finding the life-equation of an asset fails owing to the 
empirical character of the outlay function. The question suggests 
itself whether a graphic method, similar to that employed in the 
simpler case treated in 7, can be devised, which will yield an ap¬ 
proximate solution of the problem. The theorems of the pre¬ 
ceding article offer the key to such a method. 

Let us suppose that the total outlay graph has been con¬ 
structed on a convenient scale, the scale depending on the magni¬ 
tude of the quantities involved. Every point on. this curve 
determines a definite uniform charge to production curve. We 
seek that particular one of these curves which is tangent to the 
total outlay graph. The abscissa of the point of contact would 
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give us the life-time of the asset, and the initial slope of the uni¬ 
form charge to production curve would give us the rate 

Instead of constructing first the total outlay curve, we may 
first prepare a sheet with rulings, as shown in (Fig. 5), each rul¬ 
ing representing the uniform charge to production curve cor¬ 
responding to a definite U , the successive values U being chosen 
at suitable intervals. We may then plot the graph of any given 
outlay function on this same sheet and from among the various 
rulings select that on which comes nearest having contact with the 
outlay graph. This will yield a first approximation of % . A 
closer approximation may then be obtained by the usual processes 
of interpolation. 





THE SIMULTANEOUS DISTRIBUTION OF MEAN 
AND STANDARD DEVIATION IN 
SMALL SAMPLES 

By ALLEN T. CRAIG 

1. Introduction. If samples of 77 items are selected at 
random from a normal universe, it is well known that the arith¬ 
metic mean % and standard deviation s computed from samples 
are independent in the probability sense and that the simultane¬ 
ous frequency distribution is 


- r i^ Z t. r? ^ Z 

F(?c,s) = e 


If, however, the parent population is other than the normal type, 
there appears to be little known regarding the form of F $). 
In the present paper, we propose to determine the simultaneous 
frequency function of the arithmetic mean and standard deviation 
in samples of small numbers of items selected at random from a 
rather arbitrary universe. For convenience, we shall classify 
frequency distributions according as the range of the independent 
variable is ^-00 oo\(O i a*) or (0, a),a>0. We shall further 
assume that the total area under the distribution function is unity. 

2. The simultaneous distribution of x, and s in samples 
of 77*2. Let f(x) 9 -&?<x<oo be the frequency ‘unction of 
the variable x, . Let x, and x z be two independent^ erved 
values ot x . write 

X, + X z = 2 £ 

xf + x z =Zs z + Zx. z 
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We seek the function F ( x, s) such that F fx, s)dxds is, 
to within infinitesimals of higher order, the probability of the 
simultaneous occurrence of x in s vafes+ds) * 

For x and s assigned, x, may have either value Z - s or 
x + s and x z is uniquely determined by x> z ~ZZ-x>j. 

F(x,s)<dxds = ffx- 5 ) ff2x-x t )dx t <dx z 

Thus 


+fCx+s)f(Zx-x l )dx l dx z . 
Since cLx ( d.x z ^Zdxds^t have 


( 1 ) F (Z,s)= 4*f(x-s)f( x+s). 

If f(x) is defined on the interval^?, wjwe note, for Z 
assigned, that 5 £ x . Thus (1) is valid for this type of frequency 
function but the surface is limited by the % -axis and the line s=Z 
If f (x) is defined on the interval (0 / a), we note, for 
X assigned on ( O j a/2), that s< Z ; and, for x assigned on 
(&/£, Cl), that s5 a.-x. Accordingly, for this kind of frequency 
function, (1) is valid but the surface is limited by the >6-axis and 
the lines s = x, ,5- < 2 - 

As simple illustrations, let us find the correlation surface for 
the mean and standard deviation of samples of two items drawn 
from distributions of various types* 

Example 1. Let 


- OO < X < GO 
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z 

cr*rr 


e 


a* 


the well known result. 

Example 2. Let 

f(x)= e 0^x<oo. 


Then 

~z5c 

F(x,5)~ 4 e 

over the open region of the 5c s -plane bounded by the x -axis and 
the line s-x. 

Example 3. Let 

0£x.iiz. 


Then 

FCZ.S)*^ , 

over the region of the 5c 5 -plane bounded by the isosceles triangle 
with sides S*O t -s = 5c and 5 - cl - X . With a uniform dis¬ 
tribution proportional to 4/a z over this triangle, it follows in¬ 
cidentally from very elementary geometry that the marginal totals 
of the distribution of 5c are given by the known values 
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and that the marginal totals for the distribution of s are given by 
^f(s)^j rz (a-Zs), 0<S<f, 

which is the result given by Rider. 1 

3. The simultaneous distribution of 5c and s in samples of 
77- J . Consider first a frequency function f(x) t -oa<x< oo ♦ 
We have 

+ x z + Xj = J X, 

X>?+ + = Js z +7x z . 

Upon eliminating we h ave 

ZX>f+2x> / X'2+£x,g'-6x?C' f - 6 xx z ~ J s z +6x z = O. 

From simple properties of this ellipse, it follows, for assigned % 
and 5 that x, may be chosen arbitrarily from the interval 
(x- S )/2, x+S^jZ ). With 7 C t assigned, x z must be selected with 
certainty as either 



Finally we must have 




1 P. R. Rider, On the distribution of ratio of mean to standard deviation 
etc., Biometrika, vol. 21 (1929) pp. 124-141, 
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r A* T-o V^v 

Ffx,s)dxds = Z / f(x l )f(x z )fU 3 )dx,dx. dx.^ 

Jx-s& J 


, 


» I 1 p -i ^ 

x _ ? g-X, *[£ 5 -■£;*] , 

2 2 

^ = 3x-x,-X 


we o1)tain 


5^0 

dz t dx z dx, * clt'dz ds 


- 9 scLx, dx ds//? 


7?=\6s*-3{x, - x)*] z 


F(x,s) = 18s f X+S f | f(x,) f( ldD2hdR )f(2dddR) d 
Jx-sj2 * z z ' 
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If ffx) is defined on the interval (O, we note, 

for 5c assigned, that 0~ s ~5cJZ. Thus the surface is limited 
by the x -axis and the line s=5c JZ. Moreover, since x tl x z , 
% 3 are non-negative, x, may be selected from the interval 
(x,-sJZ y x.+5\fz) only as long as s £ % 'IZ/Z If 

x, ZJH< s<x, JZ. 

then X/, may be selected from the intervals 
3Z - \<5s z - 3x Z ~]z 

a 5 


and 


ii , x+sJT). 


\ccordingly. for this type of frequency function, 
rX-hS'lS. 


Ffx,shlds / 4 

“4 -sjz 




3%-x*,+ 




( 2 . 1 ) 


~ 18 s 


O^Si: 


( ■2* -[d5 a - 72*1* rX+sfc 

/ Y3S*fc*-32«l* 

L y o -2- 


L f( Xl ) t(. J *- * Lt E)f(dx,, 


xJK < S< xjz . 

Z 
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If i~(x.) is defined on the interval (0,0.), we note: 

for O - x- - a/3, O-s-SL /Z ; 

for a/3*x<2a/3, OSs< z -Zax+ *f*\ Z '* 

for 2 a/3 O-s i . 

Thus in this case, the surface is limited by the x^-axis, the lines 
s * 3E JZ and s= (a-?c) J2 and the hyperbola 
5 = \2 x> z -2 ax. + Za* ^3\ Z • 

f Pig. 1.). Now x>, may be selected from the interval {x-sj2, 
x + sj2) as long as s< xJH/Z and $< (<z-x) >J2j2. . This 
holds for that part of the surface over the region bounded by OPa. 
Por that part of the surface over the region bounded by OPU, 
may he selected from the intervals 



Fig. 1 
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It is clear that the ranges of arbitrary selection of 7 C t for that part 
of the surface over the region bounded by 'PVa are 

3 jg 2 

Z 


and 

'3x-a. + \ps z -?('a-z) z \ 
Z 


i \ 


a . 


lunalh, we find that ?c t may he selected from the intervals 

{^q 2 *. z ~z* 2 -t ^ 


-Z~\Z 


3 x - a ±{_6s z -3(a-xQ z '\ z Jx-[6 s’^ -J a;' 2 ] 
,2 ’ £ 


and 


3tc+\&s z - 3**1* 
Z 


, a 


for that part of the surface over the region bounded by PUV . 
If we adopt the notation 

we have 

r2+sjz 

( 2 . 2 ) F , (x l s) = 18s ftdx,, 

Jz-sJz 


-18 s 


x-s J2 

32-Cf x+sjz 


S3x+\&$ z -3xF\p 

-z- 




r ,3x-a-[0s z -3/'a-x) z Y 


- 18 s\ 


Vx - sJZ 


“ Z — - 


(pdx. 
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T ~3 : B-a.-X£>s z -3(a-x)Z\ z 

-W * */ .* , 

Z 


j M 4 > d **> 

7 3;^|tfs 2 -. 

Z 


for the parts of the surface over the regions indicated above. 

In order to illustrate the theory, we shall consider a few ex- 
o tuples. 

Example 1. Let 

1 

f ^ x )*ajin e ’ ~ x < 00 ' 


By (2). 

. 5^ - 

se 

Example 2. Let 




f(x.) = e , OS x,< cx>. 


By (2.1), 


F(x,s)=6j?7TSe Sx , 


OSs< 






arc Stf/7 


sZjZ 


-t-arc sin 


-arc sm s2jZ 


¥*•***■ 


Example 3. Let 


fW-i . 


OS x<a. 
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By (2.2), 


Pfc,3)» - 6 & /7S ' Qver ORa 


X-V5s z ~32 J x 

arc sir? szjz — + arc sir? — 


■arc sin over OPU, 

x-a- \6s^-j(a-x)' z ~]^ 
zrc sen - fc - — - 

szlz 


\arc serf- 


iaresm &g-arcsm 
+ ^], over PVa, 
,°M«rc sin 


x x-\6s z - 

+ arc si v ^ + arc sin ~^- Zr J 


•tare s^r? 


-arc sir? (a -%) *3 z 

5ZjZ 


-zlZ 


x+\&s z - 3x z 


arc si n s Zfz ,over PVU. 
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I have succeeded in obtaining the marginal totals for S 
from O to aj2/4 by integrating F(v, s) with respect to x 
from the boundary (Fig- 1) to s- (cl-x)JS and ob¬ 

tain as a result the parabola which is known 1 to give the distribu¬ 
tion of s from s~0 to s * a>0/<5. 

4. The simultaneous distribution of x and s in samples of 
77- 4 . We shall consider first samples of four items drawn from 
a universe characterized by a law of frequency f(x) t 
Then 

x, z +x* +x z +x%* 4s z +42 z m 
The elimination of yields 

-4xx / -42x^4xx 3 -2s z +6x z -O 

It follows from the properties of this ellipsoid that x, may be. 
chosen arbitrarily from the interval (x-sjj, 2+s J3) . For X f 
assigned, the region of arbitrary selection of x z Is determined by 
the properties of the ellipse and is ^ 

(4x-x,-Z [6s i -Z(x,-x)’f :,-xZ\6s z -Zf^ r xfY \ 

1 5 5 j 

Upon solving for x 3 in terms of x>, and x z we have 

4-x-x.'-x z ±%3s z -8% 2 +3xx, +8xx & -Zx,x z ^ 
x 3 , - 

while x A is uniquely determined by * 4x~x, -x z -x 3 . 

If we write 

Ts[<3a 2 - 8Xx, +8*x z -2x,x z x, A -2 


and 


$s f('x,)f/'x,)f( 






1 H. L. Rietz [Paper to appear presently in Biometrika]. 
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then 


( 3 ) ^ 4-x-x, +2\$6 z -'2fe l -x) Z ~\ Z j 

F(x,s)= 32 s j f 3 ± T 

4 -s/3 J4Z-x,-2[6$2-2(z r *) Z Y 


§ dx z . 


The integration can be carried out in an obvious manner when 
f(x) is the normal frequency function. 

In case f(x) is defined on the interval (O, oo), we note, 
for x assigned, that s< x • Thus the surface is limited by 
the x -axis and the line $ - xfj Moreover, x, may be 
selected from the interval x+s/3) with x z chosen 

as above only as long as 5i If x/3/3^ s < x, 

then x f may be chosen from either of the two intervals 

[o \ 

' and 

± 1 +1 P y ffg . g x + 

with x z chosen as above; or x t may be selected from the interval 


with x taken from either 


^ 4Z--x i -\3s z -8x z -3x?+8x,x']* 
°> Z 


KJl j 

(4-X4x~X ' l \ 

V “ 2 5 )■ 

when z<sf nfl we may have 

O^X>,i=Zx -\zs*-2 * *] * 
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and 


r p si i 42+2Sj5$ z - 2 x 2 ~\ z 

2x+\2S z -2x z \ z * x f < 3 


with either 


0<x z £ 4 ' x ~ x ’ - dx z -Jx z ^8xx ,] f 


or 


2\1 


4x~X t A@s z -8x z -3x z +8xxf , , 4x-x t +2\63 - 2(x, - 2)*] 

----— - S:X^~ - - — “ "' —‘ 

^ * 3 


Or we may have 


< X/ , Z +S J3 


w ith 


^) z i i 


4%~x / -2\bs ;s -2{x / -xJ zm ] z 4-x-x f +ZV>s z -Z(x -x) 2r ] 
- 5-- - b —^- L - 


Xccordingly, for this kind of frequency function, t 
rX> +$J3 p x-xl-x, +Z [0S z - Z fa ,* ^x) z 2 z i 

FM-3Zs / ? - 

Jjc-sj 3 I ±lzZhi£&£-FT\ 

( 3 . 1 ) 5 


Zl¥ 

0£3<^- > 




~32 d 

L <? 

^x+s{3 

+ 


4*x. -P \,6s z -2x z YfZx-x, +2\6s^-26e, -x) 2 ] ^ 

1-4-2 -x,-2fes z -2(x, -2)*] z 

* x 

4x-x.+Z\6a*-Z(x,-2) z '\ Z 


,/ J~ 

J4x+2\6s z -2x zm \ z J^x-x, 


2\6s z -2fx, -x) z ] z 
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4x+ztes?-2Z z ]* 42-X, ^8s Z -8x Z -3x z +axx^ z 

/ 7 / 3 


4‘X'+2\£>s z -Zx : *\ S ‘ 42-x, +Zfes ^^ 

y / 

42 -x y +\~ 8 s z -< 3 #*- 3 ^ + 3 x* 2 * 

" 5 2 

* S <*. 


Udx z dX t> 


r 2£-tes*-2Z*lz , 4x-x, -\8 s z ~8x z - 7 x,*+8xx^ ^ 


^£* 3 * r ^£^l£ Qg^^/y, * 

/ 

•^2 -x, +[5s *-8x z -3x, z -t-8x.x / Y' 

Z 

4^_±2\6s^£xf\i , fg^g, jgsf-g£j^f*gg*J^ 




4 j£-z:,-fZ[ 6 s £ -JSfc -Zj *2 * 


Jt£^£fi£pl^**dL 


X-t-sJ? 


^ 4 -x - x, +2 \pS Z -2 fe, - 2 ) ] 


+ J ^ 

•A 2+z\Os z -2x z \ z J 4x-x,--Z \ps e -2rx,-x) z ~f- 


^dx A dx r 


X<s<x0 
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By similar reasoning, the writer has determined F(x t s) 
for 77 = 4* and ffe) defined on the interval CO, a). 
The results, however, are quite lengthy and formal and will not 
be presented here. 

< OUULsl "7“ 



THE LIMITS OF A MEASURE OF SKEWNESS 

By Haroid lion uint. and Itonnud M Soiomoxs Columbia University 


The measure of skewness 

mean — median 
standard deviation 

is sometimes recommended because of its simplicity. Obviously 
neither this nor any other statistic can be of much value until 
something at least is known of its distribution in samples from 
populations of some plausible form. For populations near the 
normal form the inefficiency of the median as a statistic of location 
suggests that the standard error of s may be considerably greater 
than that of ^ 3 / a 3 > We know of no investigation of the-sampling 
distribution of 5 . Apparently even the range is unknown. The 
object of the present note is to show that s necessarily lies be¬ 
tween -landl. 

The proof consists of three successive transformations of the 
sample, each increasing s , which nevertheless in the end remains 
le^s than unity. 

1 . Without loss of generality let us suppose that the median 
is zero and that the mean £ is positive. Taking 

CD — = , 

77 77 

77 being the number of observations, which we suppose odd, we 
have 

s = x/cr. 

If a negathe observation - o he replaced by zero, the mean 
is increased by a/77 . In the second of the expressions above for 
Cff the mean of the squares is diminished by a 2 /rj , while on ac¬ 
count of the change in the mean, a further subtraction is made. 
Thus o' diminishes. Hence 3 increases if we alter the distribu- 
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Lion b} ALpUu'iiK, ail tlic negati\e observations by zero. The median 
remains Lmclnu«>od at zero. 

2 . i^cl us iurther transform this altered distribution by re¬ 
placing all the positive obseivations by the mean of these positive 
quantities The general mean is left unchanged by this transforma¬ 
tion, but the standard deviation is diminished. For, denoting by 

21 the deviation of a podthe observation from the general mean, 

22 z 2 is, tor a fixed value of £ z t a minimum when all theji/s 
are equal. 

3. Tint', the \alue of s is increased when we replace all the 
ncgati e obseivations by die median value 0 and all the positive 
ob>eivations b\ a fixed quantit>, which we may take as unity. Let 
thete be fo O sand k l‘s in this distribution. Then hbk-n. 
2d on o\u, Mine the median is at O y £?>k. The mean is k/u * while 

72 <J Z = h-O 2 * k l z -k 2 /rj = hk/rz 

Hence s = (k/h)z 

in this ca*e in which s is a maximum. Since as just remarked, 
/7 >/r . this is ab\a>s less than unity, approaching unity when the 
obsei\dtiu*i'» are divided as nearl) r as is possible equally between the 

t v\ o \silrt\s. 

To go fin ther in a stud}' of the sampling distribution of s is 
possible oid} 011 the basis of special assumptions. The same is true 
of the sonuwhat more familiar but less definite measure of skew¬ 
ness. 

mean — mode 
standard deviation 

li is» d-ai lb .1 tlvie is no limit of the range of this last quantity. 




THE THEORY OF PROBABILITY FROM THE POINT 
OF VIEW OF ADMISSIBLE NUMBERS 

By Arthur H. Copeland 

I. INTRODUCTION 

The definition of the word probability has never been agreed 
upon. Before we decide on a definition, let us first consider what 
use we hope to make of the theory of probability. It is reasonable 
to demand of this theory that we shall be able to apply it. and that, 
by means of it. we shall be able to make predictions. 

If we say that the probability is .9 that a given event will oc¬ 
cur under certain circumstances, then are we making some pre¬ 
diction about the success (i.e. occurrence) of the event? Let us 
suppose that the circumstances are presented. We may observe 
that the event succeeds or we may observe that it fails. Which¬ 
ever the case may be, the result of the experiment cannot lie inter¬ 
preted in terms of the number, .9. This is always the case. We 
can never interpret the result of a single trial of an event in terms 
of the probability of that event. 

Next let us assume that n trials are made of an event whose 
probability is .9, and that, as a result of this experiment, r suc¬ 
cesses and 77 -r failures are obtained. If r? is large, we should 
expect the ratio, r/n, to be approximately .9, that is, approximate¬ 
ly nine-tenths of the trials to be successful. We shall call the num- 
lier, r/n, the success ratio. 

We have not even now obtained a satisfactory interpretation 
for the number, .9, We have not specified any limit to the dis¬ 
crepancy between the numbers, r/7?and .9, and we have not 
specified the magnitude of n . Thus, if r/n differs from .9 hy a 
small amount, it also differs from .899 by a small amount. Are we 
to be satisfied with the statement that the event in question has a 
multiplicity of probabilities including the numbers, .9 and .899? 

We can make the above statement more exact as follows: 
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Given any positive number, £, we can find a number, 77, such that 
the discrepancy between r/n and .9 is less than £. After the 
number, £, has been chosen, it is at least conceivable that a suffi¬ 
cient number of trials can be made so that will differ from .9 
by less than<£. If we make this interpretation of the probability, 
.9, and if we wish to make the statement that .9 is the probability 
of the event, then we are assuming that .9 is the only number that 
has this property. We are therefore assuming that the ratio, r/7?, 
approaches .9 as 77 becomes infinite. 

So far as I know, no one has ever given an alternative concept 
of probability which is capable of being interpreted in terms of the 
result either of a single trial or of a sequence of trials. Unless 
and until such a concept is given, we are compelled to assume that 
probability is the limit of the success ratio, if we wish to include 
an emperical interpretation. Since this paper is being presented 
to a group of statisticians, I think it will lie safe to assume that 
we are agreed that probability is concerned with the results of trials 
of events. 

It may he that we arrived at the probability, .9, by means of 
the following reasoning. There are 9 possible ways in which the 
event can succeed and 1 in which it can fail. All 10 possibilities 
arc equally likely and mutually exclusive. 

When we make a trial of the event, one and only one of the 
possibilities succeeds. The words, equally likely, have no inter¬ 
pretation in terms of the result of a single trial The reader will 
have little difficulty in continuing the analysis of these words in a 
manner similar to that of the concept of probability. In fact the 
concept, being equally likely, is identical with the concept, having 
the same probability. We shall, therefore, reject the concept of 
equal likelihood as a basis for a definition of probability. 

There is one other objection to this method of finding the 
probability of an event. Namely, there is good reason to believe 
that it never gives the correct result. In making this statement we 
are assuming, of course, that probability is defined as the limit of 
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the success ratio. In order that the 10 possibilities may be equally 
likely, it is necessary that there be perfect symmetry between these 
possibilities We cannot, therefore, have any mark to distinguish 
the one unfavorable possibility from the other nine favorable 
possibilities. Experiment indicates that such distinguishing marks 
are sufficient to make noticeable differences in the probabilities. 
For example, the dots on the faces of a die cause differences in the 
frequencies with which the respective faces turn up. 

In spite of these objections, the above method of finding the 
probability of an event, gives very good approximations in most of 
the cases where it is applied. There is no method which gives exact 
values for probabilities. It seems wise not to reject this method, 
but rather to discard any illusions which we may have concerning 
the exactness of its results. 

We have seen that we must assume the probability of an event 
to be the limit of the success ratio, if we are agreed that probability 
is concerned with the results of trials. Let us express this assump¬ 
tion in terms of the Cauchv criterion for the existence of a limit. 

Given a positive number, C> there exists a number, N % such 
that I r/r? - r/ri I < € whenever 77 £/Vand 77' where r is 
the number of successes in r? trials and r‘ is the numljer of suc¬ 
cesses in r) f trials. Physical experiment seems to indicate that 
this condition is satisfied. Furthermore, if we reject this assump¬ 
tion we deny the possibility of experimental verification of prob¬ 
abilities. On the other hand, it can be proved that the number, 
N , can newer be known. This situation is unsatisfactory for a 
mathematical theory. 

To avoid this difficulty we shall construct an imaginary 
idealized universe in much the same manner as is done in the case 
of geometn. This universe will contain sequences of successes 
and failures which are formed in accordance with mathematical 
laws. These sequences will satisfy the fundamental assumpti >ns 
of probability and hence will be infinite. We make the assump¬ 
tion that the physical universe is an approximation to this idealized 
universe. 
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II. THE ALGEBRA OF EVENTS 
We shall show how the elements of the theory of probability 
can be treated from the point of view which we have described. 
Consider first the following physical example. A coin is flipped 
ten times and the event in question is the occurrence of a head. 
The following is a record of the successes and failures, 

/, 2 , 0 , 2 , O t O t QIQ O 

where the I s stand for successes and the 0’s for failures. The 
ratio, 4/lO , of the number of successes to die number of trials, 
is obtained by adding all of the ten numbers and dividing by ten. 

If we had made a much larger numl>er of trials of the event, we 
should expect that the corresponding success ratio would have 
been much closer to the probability, one-half. 

The above sequence of is and 0s can be interpreted as a 
number written in the binary scale. Let us write 
. 1JO j 100,010,0 

This number has the value, 1// +l/4 +0/3 +l/f6 +0/32+0/64+0/!28 
+1/256+0/31Z+0//024*209/276 . We should not, however, 
think of this number as ending with the tenth digit. In fact we 
could compute as many more of the digits as we desired by con¬ 
tinuing the experiment. The computation of the values of these 
numbers will not be important for our purposes. The above com¬ 
putation was inserted merely to aid in the understanding of the 
notation which we shall describe. 

We shall now consider the construction of our idealized uni¬ 
verse. The sequence of successes and failures of a given imaginary 
event can be represented by a number, % = rz) x (3) • * xf*-* 
written in the binary scale., the kth digit, of z being 1 wcO 
according as the event succeeds or fails on the kih trial. We shall 
denote the success ratio for the first 77 trials of this event by p/x). 
Then 


( 1 ) 


p X M /r? 

77 
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We shall denote the probability of the event, jo , by pQc) and we 
shall define p(z) by means of the equation 

( 2 ) ptej= li m p n Cz). 

7 ? •* 00 

We are, of course, assuming that this limit exists. 

Most of the important questions in the theory of probability 
involve relations between different events. We shall therefore 
construct an algebra which is especially adapted to the discussion 
of related events. If z, = . and y = .y C/) y r2) y (3) " • are any 

two events, then the event, % and y , will be denoted by x ■ y . 
We have the equation, 

( 3 ) * y-.(x c0 -y 0> ), (**>'**>). • • • • 

The first digit of x- y is 1 if and only if the first digits of x and 
y are both 1 . That is, the event, x y, succeeds on the first 
trial if and only if x and y both- succeed on the first trial. 
Similarly for the second and third trials etc. The expressions in¬ 
side the parentheses are understood to be ordinary algebraic prod¬ 
ucts. The expression, x -y , is a symbolic product. 

The event, ^ or y or both, is denoted by x^y. We have 
the equation 

(4) x^y^(xr^y-x *y ),(* +y -x y )>"' 

It will be observed that the first digit, (z f/} *y ( '-X e ' 3 y (0 Jol z*y 
is7 if or if *‘' 4 /, y c, lO or if x ( '^O t y co =l , but that 

this digit is 0 if x (O =y c -0. Thus the event, x^y, succeeds on the 
first trial if x succeeds on its first trial or y succeeds on its first 

trial or both x and y succeed on their first trials. Similarly 
for the second and third trials etc. 
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We shall use the symbol, -v x., to denote the event, not x , 
It is easily seen that n/ x, is given by the equation, 

(5) fl/x (t-X' U) ), (!’x l3) ), ■ • • . 

Let us denote the event, y if x, . byyc * * Before attempt¬ 
ing to give a formula for ycx let us first consider the expres¬ 
sion, 77 rp m (x). This expression is equal to the numlier of successes 
of the event, X , in its first m trials. Thus if is the number 
of the trial on which the nth success of x, occurs, then 

( 6 ) 

We can write 

( 7 ) ycx * . y ^ ?77/ ^y y ( rr) 3 ^» * • , 

Thus we consider those trials of y for which the event, x , oc¬ 
curs. In other words we consider a given trial of y if (and only 
if) # occurs on that trial. Hence equation (7) gives us the cor¬ 
rect expression for the event, y if x * 


[*The operators, f , v and 'v are also used in symbolic logic with similar 
interpretations. See Whitehead and Russell, Principia Mathematica, vol. 1. 
The symbol, c , is an inverted implication sign. The expression, ycx, could 
he read, y is implied by x , or, y if X . For the benefit of those who are 
familiar with Principia Mathematics, it may be added that the symbols,#, 
y , etc. are propositional functions rather than propositions. Each X is as¬ 
sociated with a sequence of events, and each is a propositional function of the 
form, the kth event will succeed, k being a free variable. The probabil¬ 
ity is a property of the set of propositions rather than of any given proposi- 
tioa Thus we should speak of the probability of a propositional function 
rather than of the probability of a proposition.] 
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PROBLEMS 

In problems, 1 to 3, assume that x and y have the following 
values 

*« 110,100, 010, Oil. 101, OOO, 10 ■ 

y = . 110, 11 1, Oil. OOO, 100 010,11 

1. Compute p tf (x) and p zo (y) . 

2. Compute the first 20 digits of (a) x y , (b) x vy, (c) ai x , 
(d)ywx; . 

3. Compute as many digits as possible of y c x and xcy. 

4. Prove the following identities: 

(a ) x y ~y x (g) /v si/ x * x 

(b) x (y z)=-(x y) z (h) /vfx y; = /vx^A/y 

(c) >svy = yv% (i) ru(x vy) = AJX • /vy 

(d) ^ v(y ) S (X v y) V2 (j) X'SrJX^l 

(c) x (y^z)=(x y)v(x z) (k) (x y)v(x ruy)~x 
(f) xv(y z) =(xvy) {xvz) 

5. Trove that p n fx v y) = p n (x) +p„ Cy) -p„ lx y) 

6 Prove that p (x vy)= pfx) +pfyj -pfx y) 

7. Prove that p(njx) = 1-pCx) 

8. Prove that p[y /vx] = pfy)-pfx y) 

9 Prove that if % ru(y ^^=1/ then x-(x y)v(x z)v(x w) 
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ril. THE COMPUTATION OF PROBABILITIES 

We shall say that two events, x and y , are mutually ex¬ 
clusive provided x fails whenever y occurs and y fails when¬ 
ever x occurs. It is easily seen that x and y are mutually 
exclusive if and only if x m y*0. It follows from problem (6) 
that 

(8) p(x,vy) • pfe)+p(y) if xy*0. 

If we have three events. x , y , and z , which are mutually ex¬ 
clusive. then x, y z = Z’ x = O . Hence 

pfa^y)* p(z) * pfe)+p(y)+&(*)- 

We have the following theorem. 

Theorem 1. It the events, x 2i x^, . . . , are mutually 

exclusive then 

p(xyx 2 ^- •* ^x n ) = pfej+pfx^+pfcz)* • • • +pfo n ). 

Suppose we have a set of events, x, , x z , • * • x^ , such that 
at least one of the events must occur. Then 
Suppose further that these events are mutually exclusive and that 
their probabilities are equal. Then pfa,)+p(xj+ * * ' + pfe n ) « 1 
and therefore p(x,)*pfe z )=p(x n ) * l/n . This principle is 
very useful in the computation of probabilities. 

Example 1. From a pack of 52 cards 1 card is drawn. What 
is the probability that this card is the ace of spades? It is reason¬ 
able to assume that the probability of drawing any one of the 52 
cards, is the same as that of drawing any other card. Thus we 
have 52 events which have the same probabilities. Moreover these 
events are mutually exclusive and it is a certainty that at least 
one of the events will occur. Hence the desired probability is l/?Z 
Example 2. From a pack of 52 cards, 13 cards are drawn. What 
is the probability that these cards are all spades? We assume that 
any combination of 13 cards has the same probability as any other 
combination of 13 cards. Since there are such combina¬ 

tions, the probability is l/ fz C^ = 1/677, 0/3, 779,600. 

We shall now compute the probability of the event, y c X . 
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We have the equations, 


... £ y 

( () ) P n (ycx,) = !*L— 
n 77 


rri; 


m. ri 

L *- 
«*/ 


77 ykyk 


77 


mr7 




where rn^ ^ = n. 

If we allow 77 to become infinite we get 

HO) p(ycx)* p(xy)/p(x). 


Multiplying both sides of equation (10) by p(x) we get 

( 11 ) pfc)'p(yc.z)* pfy-x). 

Kxample 3. A pack of 52 cards is divided into 4 piles of 13 
cards each. One pile contains just 1 heart and the other 3 piles 
contain 4 hearts each. A pile is selected at random and a card 
is drawn from this pile. What is the probability that the pile 
selected will be the one containing just the one heart and that the 
card selected from this pile will be the heart? Let y represent 
the drawing of a heart and x represent the drawing of the pile 
containing just one heart. Then p(z) *1/4 and pCyex)*l/lJ. 
Hence p(y X)=pfy)'P{ycx);$Z This is the desired probability. 

We shall say that an event, y , is independent on an event, 
X , provided the probability that y will occur is the same 
whether X occurs or not. If we express this condition for inde¬ 
pendence in terms of our symbols we will get 

(12) pfycx) = p(ycruz). 

Hence 

pcy- z) _ = p(y)-p(y-*) 

03) pcx) ~ pfojx) ~ l-p(x) 


Therefore 

(14) pfx-y) - p(z)-p(y) 

It is a simple matter to reverse our steps and start with equa¬ 
tion (14) and obtain equation (12). Moreover, from the sym¬ 
metry of equation (14) it is easily seen that if y is independent 
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of * then ^ is independent of y . We have now proved the 
following theorem. 

Theorem 2. A necessary and sufficient condition that two 
events, x and y , be independent, is that p(x-y) = pMpfy )• 
Example 4. A coin and a die are thrown together. What is the 
probability that the coin will turn up a head and the die will turn 
Up a 3 ? Let % represent the occurrence of a head and y repre¬ 
sent the occurrence of a 3. Then p{x)*l/Z and p(y) - 1/6 . Since 
the events are independent it follows that p(x y) - J/1Z 

It should l>e observed that equation (11) is always true but 
that equation (14) can only l>e used when the two events are in¬ 
dependent. The term, contingent, is used to apply to events which 
are not independent. If x and y are two contingent events 
we must use equation (11) to compute pfe-y). 

In order that three events, x . y , z , may be independent, 
it is necessary and sufficient that p(x y)*pfx)p(y) t 
p(y-z) » p(y)p(z) t p(z x)~p(z)p(x), p(x-y z) 

= p(x)p(yz) = p(y)p(z x) = p(z)pfzy) % 

This definition is easily generalized to the case of 77 events. 

It is generally assumed that the trials of an event are inde¬ 
pendent. What does this assumption mean ? Suppose, for ex¬ 
ample. that we wish to say that the first trial of an event is inde¬ 
pendent of the second. The first trial constitutes an event. X, , 
and the second trial constitutes an event, x z , hut we have only 
defined one trial of X, and one trial of x z . Independence is 
defined in terms of probabilities, and probabilities can he given 
meaning only in terms of sequences of trials. 

We can get around the difficulty in the following manner. 
Suppose wc wish to consider the independence of 77 trials of an 
event, x . We will consider n events, x f , x z , >& 3 , . . *x n . The 
first trial of x f will be the first trial of x . the first trial of X z 
will be the second trial of x , the first trial of x 3 will be the third 
trial of^: , etc. The 2nd trial of x ( will be the (r?+f)st trial of 
x , the 2nd trial of x z will be the (rf+2)nd trial of x , etc. In 
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general the digits of the number, x r , are selected from the digits 
of the numl>er, x . The digits selected are, the rth , the (r+n)th > 
the (r+2n)th,(r+}ri)ih* etc. That is 

05 ) X r *. ^ x (r*r,) z (r4Zn) jc (r^3^ . 


We can now speak of the independence of the numliers, s' 
• • •. 

It will be observed that 


06 ) 


J1L 

bZ‘ n 


Z' r +2' 


\Z-r-Zn + Z -r-3r7 ^ 




and hence we can write 


( 17 ) 


X r = X C 


2 r 
1-2 '" 


We shall abbreviate this notation still further and write 


(IK) (r/n)#, xc. - . 

1-2'” 

It is natural to assume that p\J r /'n)x] - p(x) for every pair of 
numbers, r and rj , such that 0<rin. If we assume this, and 
if we assume that the numbers, (l/n)x, (Z/n)x. 0/r>)x . . Jn/n)x % 
are independent, then x must satisfy the following equations. 


1 iy ) p[(r,/n))c (r z /rj)x ■ ■(r k /n)x]*[p(x)} l< 


for every r? and for every set of integers, r,. . . r k , 
such that O < r, < r z < * •. ^ r k ^ n. 

Any number, x , which satisfies equations (19) is called an 
admissible number. It can be proved that there exist admissible 
numbers.* It is clear that an admissible number, x . characterizes 
the l>ehavior which we should expect from a sequence of trials of an 
event with probability, p(x ). 


[*See the author’s article, Admissible numbers in the theory of probability, 
American Journal of Mathematics, Vol. L, No. 4, Oct. 1929]. 
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Example 5. An event, x, , has the probability, p(x ). What is the 
probability of obtaining precisely two successes in three trials of 
the event? It is required to find 

p{[W)x-(Z/3)z • njO/VzNU/DxWJx-vO/V* J 

Each of the square brackets contains three independent numbers. 
Thus for each square bracket we have the probability, f p6c)fp(njx)* 
The square brackets themselves constitute three mutually exclusive 
events, flence the desired probability is j[p(z)] p(v%). 

Let us find the probability of r successes and n-r failures 
in 77 trials of an event. Let pfe)*pim<\ p(^&)=q . The proba¬ 
bility that a given set of r trials will all be successful, is p r , and 
the probability that the remaining n-r trials will all be failures, is 
q 37m/m The r successful trials can be chosen in n C r ways. 
Since all of these ways are mutually exclusive, the desired proba- 
bility \* n C r p r q n ' r . 

Consider the following problem. Let x . t , z z , . . . % v be a 
set of mutually exclusive events whose probabilities are known. 
\Ye shall call these events causes. Let y be an event which can 
occur only as a result of one of the causes. The probabilities of 
y if x>, , y if x, z . etc. are also known. An ’experiment is per¬ 
formed and it is observed that y occurs. What is the prol&bility 
that this occurrence is a result of kih cause? The answer to this 
question is given by the following theorem. 

Theorem 3. If is a set of mutually ex¬ 

clusive events, and it y is such that y~v(X, >/ z z v ' ■ * v * n ) - O , 
then • 

pfc#)' p(ycjc. k ) 

pfc k czy)= -g—;—:—7-r ‘ 

ZL pfri)'p(yc.xi) 

L*l 

Since follows that 

y=6 rx t ) sy (y x z )''" * ^(yx n ). 

Hence o(y) =p(rx t ) + p(y x z )+ ••• + pfy x n ). 
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Therefore 

p(y)-p(x,) pfyex.,)* pfx z ) pfyoc z )+ • • • + pfa„)p(yoc n . 
To complete the proof of the theorem it is only necessary to sub¬ 
stitute this value of pfy) in the equation, p(x^cy)-pfe k y)/p(yX 

and then substitute pfc k ) p(ycx. k ) for pbc k y). 

Theorem 3 is known as Hayes* principle. The probabilities, 
P(*i)*pfoz)> • • • pfc n )' are called a priori probabilities, 
whereas the probabilities, p(x t cy) % pfr 2 cy), . . . p(x n cy) % are 
called a post ion . 

Example 6. There are four urns. % U f , U 2 , U 3 * The urn, (J 0 , 
contains three black balls, £/, contains one white ball and two black 
balls, U z contains two white and one black, and U 3 contains three 
white balls. An urn is selected at random and a ball is drawn from 
it and found to l>e white. What is the probability that the ball 
came from U z ? Let % represent respectively the 

drawing of U 0 . U, , U z . U s . and let y represent the drawing of a 
white ball from the urn selected. Then 

p(x a ) = pfe,) = p/x z ) = pfz 3 ) = //4, 


and p(y<zx a )= O , pfycx, )=//?. p(ycx z )~2/Jp(ycx, 3 ) = J/J. 

J-.3. 

1 fence p( xcy) - TT~iz~7 5 ' = >/3. 

Example 7. Two people, A and H, make the same statement in- 
de|)cndently. Let tliis event l>e denoted by y . Let % denote the 
event that the statement is true. Then y can be the result of two 
causes, x, = x and It is given that the probabilities of A 

and n speaking the truth, are respectively a and b. What is the 
a postiori probability that the statement is true? We know that 


p(ycx,) = ab and pfy<zx z )-(l-a)(l~b) . Hence 

, , a.b'pte) 

pfxcy) --■- 

a- b * pfy) + (1-a •p(njx) 
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ft might l>o added by way of warning that it is easy to state a 
problem of this kind* which is without meaning. 

Let us consider the problem of finding the probability that an 
event* %. , will preceed an event* y . a tie being excluded. We 
have the four possible situations. #*y X'fay)s('vx) t y. 
The first situation represents the tie. and this situation we have ex¬ 
cluded. in the second situation neither x nor y succeeds, and 
this situation should also he excluded. The event*# , will preceed 
y provided X succeeds and y fails if either of the last two 
situations occurs. Hence the desired probability is 

Pffc‘™yJcU#*'vy)v(y.,vx) }]» - -p ---;-? - 

^ p(X'rvy) + p(y rux) 

When x and y are mutually exclusive this last expression takes 
the following form: 

pfr) 

p(x) + p(y) 

IV. CONCLUSION 

The alx>ve examples illustrate how the theory of probability 
can be developed in terms of our idealized universe. By this 
method we can construct a consistent mathematical theory, and one 
which admits the possibility of experimental verification. 



RELATIVE RESIDUALS CONSIDERED AS 
WEIGHTED SIMPLE RESIDUALS IN THE 
APPLICATION OF THE METHOD OF 
LEAST SQUARES 

Walter A. Hendricks 
Junior Biologist, Bureau of Animal Industry 
United States Department of Agriculture 

In a recent paper the writer 1 discussed some considerations in¬ 
volved in fitting a curve, by the method of least squares, to data 
in which the magnitude of the errors of measurement was affected 
by the size bf the dependent variable. For the special case in 
which the percentage errors of measurement were distributed nor¬ 
mally, it was shown that the most probable values of the depend¬ 
ent variable could be calculated by minimizing the sum of the 

y 

squares of residuals of the type, V ~ too , with respect to 

V , V being the arithmetic mean of the ratios of the observed 
values of the dependent variable to the corresponding calculated 
values and equal to unity at that minimum. 

The concept of a relative residual has a certain value to the 
investigator as an aid in visualizing the nature of such a set of 
data. However, it is possible to use a different method of analy¬ 
sis, based on the theory of weighting, which will yield exactly the 
same results when applied to such a set of data and in addition 
possesses the advantage of being applicable to more general prob¬ 
lems in which the relation of the errors of measurement to the 
values of the dependent variable is more complex. 

Standard texts on the method of least squares such as that 
by Merriman, 2 show that if the probability of the occurrence of 
an error of a given magnitude varies for measurements of suc¬ 
cessive values of the dependent variable, it is necessary to weight 
the observation equations when fitting the curve. If the errors of 

1 Hendricks, Walter A. 1931. The use of the relative residual in the appli¬ 
cation of the method of least squares. Annals of Mathematical Statistics , 
2 (4): 458-478. 

2 Merriman, Mansfield. 1907. The method of least squares. 230 p.. illus. 
John Wiley & Sons, New York. 
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measurement, made In obtaining one observed value of each of 
several successive values of a dependent variable, f(X) , are in¬ 
fluenced by the magnitude of, i(X) , the probability of the occur¬ 
rence of errors of the magnitudes, x.,, x x ,x. 3 , . , 

respectively, is given by the following equations: 

P z - k z e Z z 




P 3 ak ? e 

h z x 2 
**• *”*” 


( 1 ) 


The probability of the occurrence of the given system of “*‘ors 
is given by the product: 

p‘=k'e '*••**•*+ h *** *** + ' ' ’* n * (2) 

in which P'= P ,. P x )% and k '* k t .k x k 3 .--k fr 

If the exponent of e in equation (2) is divided by a con¬ 
stant measure of precision, h 2 , the equation may be written in 
the form: t . 

P'= k'e (p '*** p *P» x n ) ^ 


in which h* - p,h* , etc., and p,, p z , p 3 ,- ■ p„ are the 

weights of the corresponding errors. P' will have its maximum 
value when the value of the expression, p t p z z 2 +p 3 x 2 + -P n x ^. 
is a minimum. 

Applying the above principles to curve fitting and substituting 
a residual, v , for every error, p , to distinguish the residuals 
from the true errors, it is evident that the constants of a fitted 
equation must be determined in such a manner that the value of 




the expression, A V, 2 + P 2 v 2 + p 3 v** • • - Pjj v n * , is a 
minimum.* 

If the equation to be fitted is of the type used in the writer’s 
previous study (loc. cit.), viz.: 

Y=AX Z .(4) 


this condition is obviously satisfied by the solution of the follow¬ 
ing equation: 


dv, 

P^Ta^z^Ta 


OVL dv. 

1/ f ^ p y - 

5 dA 


'Py, v ~ 


n d'A 




Let Y \ represent any observed value of the dependent vari- 
2 

able and let AX ( represent the corresponding most probable 
value. Then it is evident that: 

Vi = AX?-Y t .(6) 


and 



( 7 ) 


All that remains is to find the weight, p£ . 

Equations (2) and (3) show that the weights of the errors 
are proportional to the respective measures of precision, h? t h* , 
h? — h 2 . It follows from the well-known relation between 
the measures of precision and variance that any measure of pre¬ 
cision, h t 2 , is equal to Yal 2 which &£ is the standard error 
of the observed value, Y L , of the dependent variable. There¬ 
for, any weight, p L , is given by the equation: 



( 8 ) 


"•The above development follows that given by Merriman with a slight 
change in notation. 
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tn which fr z is the constant measure of precision in equation (3). 


If a set of data were obtained by making one measurement 
of each of several successive values of the dependent variable. 
AX 2 and the resulting percentage errors of measurement were 
distributed normally, it follows that the coefficient of variation of 
a number of replicate measurements made at any value of AX 
would be equal to that obtained for every other value of AX' 2 * 
In other words, the standard error of every measurement would 
be directly proportional to the value of AX 2 measured. 

Since, by equation (S), the weight of any error of measure¬ 
ment is inversely proportional to the square of its standard error, 
it follows from the above discussion that this weight must also 
be inversely proportional to the square of the value of AX* 
measured. Combining all factors of ]proportionality into a com¬ 
posite constant, C . the relation between any weight, p L , and 
the corresponding value of the dependent variable, AX L 2 , may 
be expressed by the equation: 








If the substitutions suggested by equations (6), (7), and (9) 
are made in equation (5), this equation may be written: 

cKAXf-YjiX* CteX*-Y z )X* M C(AX*-YJX] x 
^ A*X* * A 2 X 4 




C(AX*-YJXA 

4*X* 


-o 


( 10 ) 


Since the constant, C , is common to every term in equation 
110), it may be removed by division. The equation may then be" 
reduced to the form: 


v 1 


i AX*-Y)X* 


= 0 


( 11 ) 
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If tln-re are n observation equations, equation ('ll) may be 
written in the form: 

v 1 _ Y 

A ~A Z X z =0 . (1-) 

from which the most probable value of A may be readily calcu¬ 
lated. 

nA-L^zt - O 
or 

.03) 

Equation (13) is identical with the equation obtained by mini¬ 
mizing residuals of the type, V -* reported in the writer’s 
earlier study (loc. cit.). The development given in the present 
paper is perhaps the better from the purely mathematical point of 
view since it involves nothing more than a systematic weighting 
of the observation equations. It can be applied to any problem 
in curve fitting if the standard error of each observed value of 
the dependent variable is known or can be deduced from a priori 
considerations. .For example, it often happens that the means of 
replicated measurements, rather than the individual measurements 
themselves, are used in fitting the curve. In such cases the stand¬ 
ard error of each mean may easily be calculated, The reciprocals 
of the squares of these standard errors will then be the required 
weights of the observation equations. 

However, if the standard errors are proportional to the values 
of the dependent variable, it may he desirable to retain the con¬ 
cept of a relative residual. The significance of a percentage error 
of measurement probably can be appreciated by many investi¬ 
gators in various fields of research, particularly those whose con¬ 
tact with mathematics is more or less incidental, to whom a system 
of weighting would seem somewhat artificial and arbitrary. 
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In either event, the necessary computations are identical. The 
precise procedure described in the present paper, like that devel¬ 
oped in the writer’s previous study (loc. cit), cannot be applied 
when the equation which is to he fitted contains more than one 
undetermined constant. However, in actual practice it is usually 
sufficiently accurate to substitute the square of the observed value 
of the dependent variable for that of the corresponding most 
probable value in equation (9). If this is done, the method can 
be applied to any equation which can be fitted by the method of 
least squares. Using this substitution is equivalent to expressing 
the errors of measurement as fractions of the observed values of 
the dependent variable when the standard error of each measure¬ 
ment is proportional to the quantity measured. 

a■ 
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I. Introduction. 

It frequently happens that all of the individuals of a sample of 
statistical data from a multivariate population are not observed or 
classified with resect to all of the variates. If a sample be repre¬ 
sented in matrix form by allowing the rows to represent the in¬ 
dividuals and the columns to represent the variates, then the matrix 
of the type of sample with which we are concerned is incomplete 
in that some of the elements are not present. As an example of 
a fragmentary sample of this nature, we may consider a series of 
measurements taken from certain parts of a group of human 
skeletons from some archeological find, in which some of the parts 
under consideration are missing from some of the skeletons. 
Again, we find such a class of samples in the social sciences and 
government statistics arising from incompletely answered question¬ 
naires. 

In dealing with fragmentary samples, it is important to have 
at hand techniques which will enable the investigator to extract 
as much information as possible from the data. This is especially 
true if the data are unique or expensive. An important problem 
in this connection is that of estimating the population parameters 
from the sample. 

In this paper it is the purpose of the author to investigate in¬ 
complete samples from a normal bivariate population. To be more 
specific, samples are considered from a normal bivariate popula¬ 
tion of x and y, in which s of the items are observed with 
respect to x and y , rrj with respect to x only and n with 
respect to y only. In the first part of the paper we shall con¬ 
sider various sets of simultaneous maximum likelihood estimates 
of the population parameters and the limiting forms of their 
sampling variances and covariances in large samples. In the sec¬ 
ond part we shall consider other less efficient, but simpler systems 
of estimates. 
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II. Simultaneous Estimation by the Method of Maximum 
Likelihood. 

Let a sample to of /V individuals be drawn from the popula¬ 
tion of the two variates x and y whose distribution is given by 


ZTicf^VFr^ 


/ \(x-a) z (y-b) z _ 2 rCx-aXy-b) 1 

ia-r 2 -) _ &x#y J 


where a and b are the means/ and cr y the standard devia¬ 
tions and r the correlation of and y in the population. Let 
u) X y he the set of 5 individuals of this sample observed with re¬ 
spect to x andy , co x the set of m items observed with respect 
to x only and u)y the remaining n items observed with respect 
to y only. To avoid trivial results we shall assume that s is not 
zero. Furthermore, we shall let ? and ry be the variances'and 
J the convariance 1 , 2 and y the means of a: and y ino^y. 
The variance and mean of x in cJ x will be denoted by u and 
2, respectively, and similarly, the variance and mean of y in 
cJy will be v and y / .The joint distribution of x , y , ^ . 

, y t , u . V . ? , T) and $ can be written from several 
v ell knowm independent distributions as 


0 ) 


-S-m - s-r? - s - f Z+(x-<2) 2 . rt + fy-t>) 

F~K(o x ) (a y ) (l-r z ) e 2<1 ' rZ) \ 

-Zr !^W] 

x y 4 ^ x 

U^v 


where 




fm) ) L S £ 


m+n+Zs 

- 2 - 


(„iir( 1 2 L )r( ja r L )r(¥)r( I ir) 


1 11 , for two sets of variates. t ti , t iz , * - * ,n » ^22> * * * ^2/7 

we let tj 'fjL f tj t and V/* (tji - ~t k ) ,(j t k = l t Z). 

then / and ^ are the means, y, and </, are the variances and is the 

covariance of the two sets of t ‘s. 
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The likelihood of u) when u) is specified in terms of the 
foreeoii’G statistics is given by (1), We shall use this expression 
of the likelihood to obtain approximations for the maximum like¬ 
lihood estimates of the \>opulation parameters , cr y , r , a 
and b . Following Fisher-*, we shall take the logarithm of (1) 
and denote it by L . For convenience, we shall, once for all, set 
tip the billowing set of first derivatives. 


(e,) 


daa. 


8 f (it-a) 

o* YdJl-r 


*Cx-a > r(y-b) 


V 


O v (l-r*) 


/3('y, -b riZ~a)_ 
y la v O-r 2 ) * cr y ' cf x n~r z ). 


(eJ^L *±\ 

bb o y L 

zb*' y 

fe , dL 3 T 5 r / l n ZrS V 
1 5 t>r l-r*[ r +a x <,~l-r*(oZ *o*~ 0*0 ) 


da y "yl 


/ 7 r ? 1 /5r 

2 ( J 2 <?„ <*,/' + o'. 




where Z~Z+(x-a)*, rj *yy+(y-b ) z , a=u+f£,-a) z 
v^v+fy.-b)*, S = S+(2-aXy-b), +=j,/S*-j- 


In order to consider the limiting form of the sampling vari¬ 
ances and covariances of the maximum likelihood estimates, we 
shall need the matrix of mathematical expectations of the second 
derivatives of L with respect to the five population parameters. 
This matrix of expected values turns out to be, 

,-’ K - A- Fjsher, The Mathematical foundations of themetical statistics. 
Philosophical Transactions of the Royal Society of London, Vol. 222 (1922). 
pp. 309-268. 
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dL dL dL_ dL_ j$L_ 

dz, dn t <52, dz x dst s 



where z, is identical with a , b ,0^, Oy, and r as l takes tht 
values 1, 2, . . 5 respectively. Again, •<- and 6 denote the 
ratios and which we shall consider constant ass+ou . 

The maxi m um likelihood estimates of any number of the five 
population parameters for given values of the remaining paramet¬ 
ers are to be found by setting the corresponding first derivatives 
in (2) equal to zero and solving the resulting equations simultane¬ 
ously. In most of the cases of practical interest, the solutions must 
be reached by approximation, in this paper we shall consider the 

following cases: 

1. Estimation of a and b for given estimates of a x , dy 
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and r . 

2. Estimation of cr and <3 y for given estimates of a , b 
and r . 

3. Estimation of < 31 , and r for given values of a and 

✓t y 

t> . 

Before proceeding with the maximum likelihood estimates of 
these parameters we shall consider the notion of the efficiency of 
a set of statistics designed to estimate a set of population para¬ 
meters. 

1. Joint efficiency of a set of estimates. 

In order to attach an economic value to a sample and its in¬ 
dividuals, Fisher® has defined the reciprocal of the variance of a 
maximum likelihood statistic w of a sample from a univariate 
population as the amount of information contained in the sample 
relative to the population value of w . For large samples, in which 
the distribution of w tends to normality, this quantity is a con¬ 
stant multiple of the number of items in the sample. The amount 
of information contributed by each member of the sample can be 
found by dividing by the number in the sample. 

We can extend the idea of amount of information relative to 
a system of population parameters contained in a sample by con¬ 
sidering the reciprocal of the determinant of the limiting values, in 
large samples, of the variances and covariances of the maximum 
likelihood estimates of this system of parameters. This extended 
definition also holds for systems of parameters estimated from 
multivariate populations. 

The reason for adopting this determinant as the extension of 
the idea of the amount of information relative to the set of para¬ 
meters under consideration, is apparent when we note that the 
square root of its reciprocal enters as a multiplier in the asymptotic 
normal distribution of the maximum likelihood estimates of the 
parameters in the same way that the square root of the reciprocal 

8 R. A. Fisher, Statistical methods for research workers, third edition. 
Oliver and Boyd (1930) pp. 266-270, 
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of the sampling variance of the maximum likelihood estimate of 
a single parameter enters as a multiplier in its asymptotic normal 
distribution. 

Fisher 4 has shown that for large samples, the maximum like¬ 
lihood estimate of a population parameter is distributed with small¬ 
er variance than any other statistic designed to estimate the same 
parameter. In the case of a set of parameters, the determinant of 
the matrix of limiting values, in large samples, of the variances 
and covariances of the maximum likelihood estimates of the para¬ 
meters is smaller than that for any other estimates of the same set 
of parameters. 

To prove this, let us consider a set [ p, | • * n) 

of population parameters, and let the set 1 1 L J be their maximum 
likelihood estimates, whose sampling distribution for large samples 
is 

(zrt)i e ' 

where //- j h LJ j , where h LJ = - EL and L is the 

logarithm of the likelihood of the sample. J H is the reciprocal of 
the matrix of variances and covariances and covariances of the i f s. 
Let the set £ u L j be any set of estimates of [ p L j in which at least 
one a is not a maximum likelihood estimate, and let the asymp¬ 
totic normal distribution of the u’s be. 

VR e ~ L 4j-i (Ut ~ p ‘ )(U J ' p J } 

(2TT)% 

where H- / k t j | , which is the reciprocal of the determinant of 
the matrix of variances and covariances of {u ( ] . Now, oui 
problem is equivalent to that of showing that H >K. Suppose 
there is at least one set of estimates of £ p L j containing at least 

*R. A. Fisher, The theory of statistical estimation, Proceedings of the 
Cambridge Philosophical Society, Vol. 22 (1925) pp. 700-725. 
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one estimate which is not a maximum likelihood value, such that 
the reciprocal of the determinant of its variances and covariances 
is greater than or equal to H. Let this set be [ u t ] . Then, by 
hypothesis, K-H. 

Let T be any linear transformation u-p-^Za^x^oi pure 
rotation of the axes representing a t -p t - (■ n) about the point 
Pj'Pz, • • • Prj a s origin, which will reduce k-jfupP ; Xuj-pj) 
to a sum of squares, jfc f x*. Here we have * j^ k L j a t ^ 

<zy^ and = *6 b^ (ui -pj) , where b c< is the cofactor of 
in jay l . Then is the reciprocal of the variance of the 
variable u_, =£ b-u ■ about its mean value p. = Bb-.p,-, and 
Tj f r K , since the determinant ( a.' L j | of T is unity. But 
is not the maximum likelihood estimate of , since at least one 
of the us is not a maximum likelihood value. As a matter of fact 

77 - __ 

ZL ^i< > say, is the maximum likelihood value of p^ , for 

dL * dL dp L * dL 
dp K i-i dp\ dp K uf^dpi 

77 

vanishes only for p L » , that is, for p K -JL > (provided 

we assume that $-p. = 0(1=12, • • -r?) has the unique solution p- = ). 

It follows from Fisher’s 5 proof for the case of one variable, 
that the reciprocal q of the variance of is greater than k^. 
Hence, Tf a > 17 k. We note however, that the maximum like- 
lihood estimates [ are not independent, for their distribu¬ 
tion is 

K, \& „ 

Un)* 


n 


where h^L^h^a^aj^ , which is not necessarily zero for jS . 
The effect of this non-independence is to introduce a term as a 

77 /c 

multiplier offi^ cp^ , where t< is the determinant of correlations 
among the f's, and is less than unity. Hence, 77 > TT q > 

cW 


5 R. A. Fisher, loc. cit 
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Tt kfK- ^ is well known from the theory of quadratic forms that 
the matrix || h </S |j is found as the product || a £ y || * || h L j || * || a t j || , 
where |[ a ^ || is \^lj || with its rows and columns interchanged. 
Since the determinant \ c £ j\ is unity, it is clear that j | * 
which is equal to 77 ^ , has the value J h £ j j which is H by 

definition. Therefore we have H>K y which contradicts the hypo¬ 
thesis that K2 H. Hence, we must have K< H . 

Thus, the proposition is proved that the reciprocal of the de¬ 
terminant of variances and covariances of the maximum likelihood 
estimates {^] is smaller than that of any other set of estimates, 
all of which are not likelihood values. 

We are now provided with a means of measuring the joint 
efficiency of a set of estimates in utilizing information in the sample 
relevant to the population parameters estimated by the set. We 
shall take as a measure of this efficiency the ratio of the reciprocal 
of the determinant of its variances and covariances to that of the 
set of maximum likelihood estimates of the same parameters. 
This quantity is less than unity, as we have just proved. -The ef¬ 
ficiency of ] is therefore 


(4) 




2. Simultaneous estimation of a and b . 

We shall suppose that satisfactory estimates have been ob¬ 
tained forc^, cfy and r . If they are to be taken from the 
sample co , we can take o* as the variance of the x’s in uO X y and 
co x , a* as that of the y’s in co^ and and r from u> X y . 
In any case our problem is to find the optimum values of a and 
b for given values of & , & and r . These values of a- 
and b are found as the solution of the equations obtained by set¬ 
ting (e f ) and (e z ) in (2) equal to zero. Accordingly, we find, 



popula ru '.\ par nr nr nits 


17Z 


■ S') 


icp 


(1+0)2 + 4Xj /1 

/ •*_ r^) /r ^ 

_ C "* 


*; <r* 7 






fy, -y) 




where A 


1 .a J (1-r•) [ 




l + °c+j3+*/3( 1- r 


■>i- 


The matrix of the vaiiance* and covariances of a and B 
’n samples i.s obtained bv taking the reciprocal form of the two 
way principal minor in the upper left corner of the matrix (2)*’. 
Thus we find, 


o*[l+8(l- 2 )\ 

r °x a v 

sD 

sD 


Cfy [l+<(l-r*y\ 

sD 

sD 


where D = 1+ * +/3+4,/5{l- r z ) 


We note from (6) that the variance of a is 

• r z - [l+/3 {l-r 2 )] 

L 3 = s[U«,+/ 3 +*c/ 5 fJ-r*)] ’ 

«iv[ a similar expression holds for . The correlation coeffi¬ 
cient of a and T> is 


r< ^ = {iL+/3a-r*mi+«,a-r*n)i 


6 Sec Karl Pearson, On the influence of natural selection on the vari¬ 
ability and correlation of organs, Philosophical Transactions of the Royal 
Society of London, series A, vol. 200 (1900), pp. 3-10. Here Pearson gives 
a method of obtaining the variances and co-variances of the variates in a 
normal multivariate pr/.abjitj function* 
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Prom the definitions in section 1, we find that the amount of 
information in to relative to a and b is the reciprocal of the 
determinant of (5). That is, 


( 7 ) 


A Cm, v, sj = 


s z +s(m+n)+ mri ( 1 - r z ) 
°*Cy(l-r z ) 


From (7) we can find the relative amounts of information 
contributed by members of co xy , u> x and to y by means of dif¬ 
ferences. For given values of ~n and s , we have as the informa¬ 
tion contributed to co by an mi 1 st individual of io x , 


(7a) 


A (m+I)=A(m+l, 77 , s)- AC 777 , 71, s)= 


s-t 77 (l-r z ) 
o£o* (l-r z ) 


jwhich is independent of rn. A similar expression holds for the 
jii 1 st member of co y . 

For the s+lsT member of u> xy , for given values of m and 
'rj , we get 


( 7*0 


cO 






m+77+2s+l 
<7*0* (l-r 2 ) 


" It is dear that an additional member to is more inf or- 

V CT7HT1+1) rZ 

mative than one to each of co x and coy by an amount ^z^z(j. r z)y 
or, considering the ratio rather than the difference, we have 

1 A(m+1 , 77//, s)-A ( m, n i s) rV m+n+l) 




Aco xy Cs+ 1 ) 


~ ^ ~Z~S~+ 777 j 77 TJ) 


ir We find that the amount of information introduced by t o x and 
<y y is A(nins)-AfOO j sX which is its 

ratio to the total information (7) is ^ y 


S* + s(rn+n)+ mr7(I-r z ) 


1 - 
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3. Simultaneous estimation of <3 X and <?y. 

If we suppose that r is given as well as a and b , we can 
find the optimum value of O x and by solving the equations 
obtained by setting (e ?/ ) and (e 4 ) in (2) equal to zero. Ac¬ 
cordingly, we get, 



_ 2Ef£F-G z ) _ 

ZEF{l+*)-G -/S)+ V4 BPG z (1-kXF/5)-jSJ z 


° y ~ Z EF(J+/5)-G z f/S-*)+ V4EFG i Cl^)(l*/3) +G*(*-/$)* 
where E=*u + j Z j r z > F:0v+prn and G‘0r • 


The variances and covariances of and <5, are given by 
the reciprocal form of the matrix obtained by striking out the last 
row and column from the third order principal minor in the lower 
right comer of (3). For the variance of a x , we find, 

iz _ c^f lZ(l+/3)~ r z {2/3+1)-] _ 

Zsl2(U<XU0)-r*fa+0++/5)li 

A similar expression exists for • The amount of information 
yielded by a? relative to and under these conditions is 


( 9 ) 


4(rn+$Xr)+s)-Zr z l.STrusr]+ZrT)n'] 
A (rri^n\sj= n rZ /71 n 2 ) 


0*0* (J-r*) 


From (9), we find the differences corresponding to (7,a,b,c) to be 


(9a) 




a*a* (l-r z ) 


(9b) 



(s+l)= 


~ cr 2 (l-r z ) ' 


(9c) 


A'(m-fl, s)-Afm, ?j s) 

A ^ y fe*l) 


r 2 frTHn+2$+2) 
4s-tZ{m + ru-l)-r z fm-in) 
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4. Simultaneous estimation of a , O' and r . 

^ y 

Let us suppose a and b to be satisfactorily estimated. For 

large samples, a and b can be estimated from the sets of x's 

and y’s obtained by pooling <*> , oi and co . Whatever esti- 

jcy x y 

mates we may choose for a and b , our problem is to solve the 
equations obtained by setting (e 3 ) , (e 4 ) and (e s ) in (2) 
equal to zero, for & , a and r . 

Xf y 

If we denote the quantities in the brackets of (e 3 ) , (e^) 

and (e f) by f , g and h respectively, then we are to solve 

the equations f=g*h^O for cr^ , & and r . The method of 

elimination seems to be of little value in solving these equations. 

Then we shall use the extended form of Newton's approximation 

method and find an approximate solution. Considering nothing 

higher than the first order terms of Taylor's expansion of f,g 

and h we have (letting cr , cr=y , ), 

^ y 

+ 

f, + (z-x,)f^ +(y-y t )fy f + =O 

(10) - g i + (x’X / ')9x t +(y-y')gy+(ss.--z t )g Zf *0 . 
h f +(x-(y- y j )h y ^ *0 

where z y ) f anc [ s0 on . 

We shall take for the initial point, ?c t =* , y f * and 

s, - , which are, for m=rj = 0 and a=x, b=y , the maxi¬ 

mum likelihood estimates of a ' , cr and r from a) . 

y 

Solving equations (10) for-*, y and z by Cramer's rule we 
find -for the first approximation beyond the initial values, 
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where Z>= 1 (l-p 4 )™ A p =^J== . By using the 

point wlio^e coordinates are given by (11) in place of the initial 
point in (10)*we find a second approximation point, and continu¬ 
ing the process we get a sequence of points. Such a sequence 
would raise questions of convergence which will not be considered 
in this paper. However, it can be shown without much difficulty 
that the likelihood of the point whose coordinates are given by 
(11) is greater than that of the initial point for variations of u, 
and v about f and rj respectively, and for u * f and v = ry 
the likelihoods are equal. Indeed the problem is equivalent to 
showing that the ratio of the likelihood (1) with the values , 

JTj and for , cr y and r to the likelihood with 

the values given by (11) for a x , cx y and r has a maximum of 
unity for variations of UL and v about £ and ry . This can 


be readily done by examining, in the ordinary manner for maxima 
and minima, the first and second derivatives with respect to a 


and v of'the ratio of these likelihoods. 


The matrix of limiting values of the sampling variances and 


covariances of the maximum likelihood estimates of CT X , <fy and 
r can be obtained by taking the reciprocal form of the third order 
principal minor in the lower right hand comer of (3). This reci¬ 
procal matrix is, 

( 12 ) 


cr 


ITS 

ZsE 


(l++(l-r 4 ‘)) 

ZS£ 

zsa 

rc x (i-r*)(l+6(l-r*)) ra y (l-r*Xl+4(l-r 2 } 

2s£ 

ZsZ 


ZsZ 
a y (l-r z )(i 


ZSE 


ZsZ 


where + 1-rV . 

The amount of information in oo relative to , a y and 
r is the reciprocal of the determinant of (12). Denoting this 
quantity by B ( m t s ), we have. 
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4 

(13) Bfm t n,s)= z z j— I s 5 +s z (7r)+n)+mns (1-r*)\ 

Ox 0y(l-rV* L J 

Proceeding as we did with (7a), (7b) and (7c), we find the 
following incremental contributions * 

(13a (l rZ)5 \**4.an(J-r*j\ 

(13b )3 U> (s+l) = a z o*(i-r z ) 3 t 5 s z + 3s+l*(Zs+lXm*-n)+wv(l-r 4 )^. 
xy ^ y 

(13c) 3(m+l i 5 )- 73 s) = o-z^a*- (l- r z ) 3 \j?8*-+sfa7+n+l)(l-r*)[ 

x y 

( 13 ( 1 )^( 777 , 77 , s)-BfO,Os)= <7*o*-(x-r z P mnsQ-i*^. 


We note that the s+lsi member of co^y is much more im¬ 
portant than an additional item to each of and to., when n , 
Oy and r are considered, than when a and b are considered. The 
amount of information contributed relative to r by and coy 
can be found by differencing the reciprocal of the element in the 
lower right corner of (12), with respect to w and n. If we call 
this reciprocal K^rn^s), we have as the ratio of the contribution 
of information by u o x and cOy , to the total information in co re¬ 
garding r, 

K re (r77 t n t s)-K re (O t O f s ) z s(/r?+n)+rrr?7jfl-r z ) 

K re (n 7 y r? y s) s 2 +S (m+r?)+ mn ( 1-r*) 

In a similar manner, we can find the contributions of the various 
parts uD X y , u) x and a)y to the information relative to any one 
of the parameters o' , a and r and we can find their effects 
upon the covariances of the maximum likelihood estimates by con¬ 
sidering the non-diagonal elements of (12). We find that the m- 
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formation afforded by cJ w relative to ov expressed in terms of 
y <*■ 

the total amount of information in ul v/ , oJ. and cJ regarding 

x.y x y 

°x’ is _ r 4 su _ 

s z +s(m+n)+mn(l-r+) 

We remark without going further, that, by considering the 
five equations obtained by equating each of the expressions in 
(2) to zero, we can find approximations for the maximum like¬ 
lihood estimates of a,b , <X x , 0 y and r by the foregoing method. 
Since the process is straightforward, though somewhat cumber¬ 
some in that it involves fifth order determinants, we shall nc* 
consider it here. 

III. Systems of independent estimates. 

We have seen that the problem of finding the maximum like¬ 
lihood estimates.of cl, b, cr x , cf y and r from the sample leads 
to expressions which are not very simple, especially from the point 
of view of practical application. However, the variances and co- 
variances of these estimates were found to be relatively simple. 
In view of the difficulties connected with the foregoing maximum 
likelihood estimates, we shall devote the remainder of A this paper 
to a consideration of the moments, distributions and efficiencies of 
simpler systems of estimates. 

If we are interested in the means of the xs in to apart from 
any contribution of the y’s, the optimum value of a is • 

Similarly, for the means of the y’s, we have y o = . The 

best estimates of the variances , and 0^ under these con¬ 
ditions are, 

\_S?+7r7U + sC2-X 0 f+ 777^-Zj*] 

\s>7+ r)v+s(¥-J 0 ) z + S7fr,-y 0 )*j , 

where N, = s+ nj, and N z =s+n. 

For the covariance we shall take the product moment of the 
deviations of the-e’s and y’s in from x o and y o respective- 
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ly. That is. 



-*J( Vi-y 0 ^ s+ (x-2J(y-y 0 )• 


From these values, we can take as the estimate of the correlation 
coefficient, 


n, 


Jo _ 


1. Distribution of and y 0 
The variances and correlation of 


and y 0 can be found 


from 


Vw?75 


(ZnJJcr^i/TJz 


_s_ 

' zIFr*) 


~S* 


’Cc-*f Cy-bf 




by making the substitution ,an< ^ us i n & 

determinant analysis 7 on the symmetric matrix of the resulting 
quadratic exponential. 2 z 

The variances of x Q and y Q are found to be and ^ 
respectively, and the correlation between and y 0 is - 

The exact distribution of and y Q is normal. 

The amount of in formation relative to a and b furnished 
by S 0 and y o is, according to our definition, 


(14) 


( S+ 7T})(s+ n) 


a z a z f 1- _ 

a x°yY (rr?+s)(T7+s) 


■] 


The efficiency of and y o is, therefore, the ratio of (14) 
to (7), which is 

_ (777+ s)*(r»s) z (I- r z ) _ 

[ (m+sXn+s)- rr?r?r 2 }[(m+s)(rr+$)-s z r 2 ~)' 


7 Karl Pearson, loc. rit. 
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2. Characteristic function of ? c , v 0 and 5 0 . 

The characteristic function or generating function of the 
moments of , T) 0 and , which we shall denote by <p(fe e), 
is defined as the mathematical expectation of e t Since 

* Vo an( ^ are expressible in terms of x , y , x f , y ; ,u, 
v, £, r) and 5 , whose distribution is given by (1), then clearly, 
we can write, 


(15) cp(r i S,£)‘Je r ^ +6y} ^ e:S o fr dx /. 


Where F is given by (1) and dV is the product of the differentials 
of the variables in F> and the integration in taken over all possible 
values of the variables. 

The integral (15) can be broken into the product of a con¬ 
stant by a quadruple integral, a triple integral and two single in¬ 
tegrals. The quadruple integral is of the form 

4 



t> u (h’C)(tj-ci) 

fa lJ 1 1 J J di,di z di 3 cH+ 


TT Z 

which has the value 8 , where A is the determinant | b L j | 

% 4 ) and b L j =b JL . The triple integral is of the form 

f *> r o, f Vxy -(c„z+c zz y + c lz *) 3=4 

e fey-2 z ) didxdy 


Yxy 


which has the value 9 , - 

Vrfr^)r(^) 

5-i 


& Karl Pearson, loc. cit 

°See V. Romanovsky, On the moments of the standard deviations and of 
the correlation coefficient in samples from a normal population, Metron, vol. 
5, no. 4 (1925) pp. 3-46. 
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Each of the single integral's i\ of the well known form 

-c 2 t _ 

J t c di 

r~ ( ) 

which ha^ the \alue ■ . 

Using the above results tor the integrals into which (15) 
resolves itself, we get, 

(U>) ,v n sd K -'ZLl . _ dzl 

<p(ts,£)~A * B Z (AB-C z ) Z (A-^) 4 (b-£ ) ~ 




<£•* / r z m z n z 

+ 4s z ' \ Z 


" \A/ Z 1 Zso^N \ z N*AJ 2 J’ 


where A = -— 5 - , , /4 =—— 


_± _ g _ ± 

fCJ-r 2 ) ’ ‘2a*tf-r*)' 

- y 


Zo^w 


hH a* si 


If ue write M(h,Kj) 'JpTJJTJp <?(**,£)L 


6*E-0 


we find the following expressions for the first few moments of 
• <?„ anti 3 0 , 

Wl.0,0)- %' aj, WO.1.0)■ 2!j4 ef 


M(0 } 0,1) - re* ^ s ^ s77pv z ) 

Maxm- ^ 

(17) ! ~17T L(N ' + 1)(^T7- z + 3 - 1 ) 
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M(Z,0,0)~ > M/0 l Z,0)‘ 


~~W~' 


„ m) , 




raj&j [4fs-lYl*r 2 ) 4mv /. mn 


N,N 2 


* 




( N, N, J 


.HLfJTLH ^-i)V— 

s ' N,N Z / ls/V,/V z 


77777 * ~V N z + N* N - 3 


S V, N t N z 

If the sample u> is fairly large, we can neglect the contri 
butions of the means x,p,2/, and y, to , rp o , 3 0 and con 
sider as satisfactory estimates of the variances and covariance, 


J _ %+* u 

L+* 




r}+/3'v 
= 1+0 9 


5- 5 


4 

y/ 


3. Characteristic function and sampling distribution of £ 0 , 
and 5^. 

It is clear that , /% and ? are obtained from £ , yy 0 and 
by dropping the terms involving the means x , y , X / and 
. The characteristic function ft6,8) oi these statistics can 
be obtained from (IS) by replacing % 0 , ry 0 and S 0 by % o , Q a and 
and integrating. The integral in this case will not involve the 
quadruple integral, but only the triple integral and the two single 
integrals. Accordingly, we find 

<p(/,£,£)= A B-C z )^(A -) "*** 

x (B\) ^ [(A-% )(B- xJ-(C+i- s f ] ^ 

which is somewhat simpler than (16). 

The first few moments of % a , r/ a and 5 0 evaluated from 
€J are (using the notation of (17), 
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Mfl'Ofi). ? l (N <" Z) , M(0,1,0)* ZyS** Z) . 

/ v 


A/r< 3 <?/> ^ir r ** a ? 


M( 1,1,0)* J*fy. f* 2r z (s-l)+(N,-Z)(N--Z)\ 

N t N z \- J 

Mil,O.D- MfO.V). 

(»> „UAOh S2£S’ 


M(o,OZ)* 


o/op(s-l)(l+r z s) 


M(1,1,1)* \z(N^N z -Z)*z(s^l)r^(N r zXfl 2 -Z^ . 

In order to find the exact sampling distribution of % o , 7} o 
and it is more convenient to consider the statistics ?* j' ^, 
= 4^ 9 and ’5,-5,. The characteristic function of these sta- 

'0 S 'o yy/ yy 

tistics is found from (pfys, £) by replacing —' / by Y } ,-j-Vby 
<5*, , and <f by <f 7 . Thus, we have 


(19) <p,(f r 6 r e,).A*B, b (A,B r C*)(A,-()%M)%yfy-S)-(cA)^ 


where A, , &, . A, , 3 , and C f are the constants 4 ,3 , A , 3 
and C each multiplied by s , and a* and c= . 

The distribution )of ^ and 3, is then, the solu¬ 

tion of the integral equation. 


m[77 ?A * r, *'+ Sl * , * £ '*' f(? r O r S l )d^do l dl l ‘ep f (/ l .6,,£: i ). 
J ° J ° 
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We note fiom 1 1«) that the factor (A,- if, ) can be 
wiitten as (ArK^f 1 ’XT*) * ancl likewise with respect to (0 r S/* 
For sufficiently small values of f, and 5, these terms can 
lx? represented bv senes expansions. It will be convenient to re¬ 
arrange the product of these two scries in a power series in r z . 
Expanding and arrantjim* in this manner we get 

(2i 

'£ T7 '4 <‘> r b“ -jMtojttifc) ‘(ffr)' • 


Each term of this expansion is of the form 

( 22 ) <P K fr„6 r £,b GjArO**(Brij bk ^ f (XB t -S,)-(c,*p*\ 


where G k is a constant independent of 3f, S and £ { . 

We are now in position to find ft 'O n ^ l ) as a wics of 
terms f k ( f fj q {t S t ) whose form is given as the solution of 


HI * 


'r,? t +*.o t * e , s , 


■f k (t,.9 r '$,)df l drj l <iS*<p i( (i' ll 6 ll £ l ) 


'J'he integral conation (23) can he solved by the methods used by 
Romanovsk_\ ,u . Following Romanovsky we find that 


(24) 4 (? ll0r s t ) m6tf e ~ A>a S c -i ^ c ±(^) 

where u) fy£ “is an even function of — = f , say, which satis¬ 
fies the condition 7 7 


1W V. Romanovsky, loc. cit 
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(25 )J t z 9cO(t)dt‘ 


r(g+j)r(c+g) _ 

r(a k +c+g) r(b^ c+q )nj) r(c) 9 


for g =0,1,2* . . . and u) (t) is independent of g . 

To solve (25) we observe that the right side can be written 
as 


(26) Mg sHJ'Ju g ~*(l-u) ak + c ~ ^v c * 9 l (i-v) b *'‘dudv, 

c o 

where//* TyJjr(c)r(b k )’ffap-J) * The^-th moment of t z \s now 
identical with the g -th moment of the product uv Since <x) (t) 
is even, we have, 


(27) jt z< >o>(i)di=±M 9 . 

Setting v , dv in (26) we find 

(28) co(i) a ** C ~ (1-u) bk ' 1 da . 

Making the transformation jrf r z s 6, we finally obtain, 


(29) a)(t)= 


a-tv 


a *+4k' t ‘ c ~'k 


r(% ) r(c)r 


—> b k> “k+tpH* ‘~ tZ \ 


where the F function is the ordinary hypergeometric series. Us¬ 
ing this form with t replaced by, in (24), we have f^ r 7? p 
fully determined. tV> 

The complete solution f( (20) ^ ^ ^ oun< * 
by summing all of the expressions of the form (24) whose char¬ 
acteristic functions appear in the sum (21). Without much dif¬ 
ficulty we can stun this series by expressing the coefficients as beta 
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functions and interchanging the order of summation and inte¬ 
gration. Accordingly, we can express f ( £,, O r ) in closed 
form as 


(30) 


_ -A^-B^+ZCSa+c-i b+c-l za+b+c-\ 

Ke X, 0, u~s=' 


0, 




J J 




O 0 


x e 


Z(l '(v, K v,x ' B> + *!-}-**/ jy) dxdy, 
K,v/ 


where g- tfB, b (A&O c _ 

r(i)r(a)r(b)r(c)r(c-i) 


The distribution of 7] c and can be found by the 

change of variables f/ = j* ? 0 » V, s is? Oc » % * % • ^ * s c ^ ear 

that our estimate t Q - of the correlation coefficient can 

__ 

Tange in value from to J N, J^ * 

4. Moments of £ o , f) c and when r«0. 

The general product moment M(tyk t A)0 o ^ob¬ 
tained from (30) for is extremely unmanageable and im¬ 

practical, since it is a generalized hypergeometric series expressed 
by a quadruple summation. However, for r*O t tf(h t k t t) is quite 
simple. Indeed, for this case, we have, 

„ Kr(a)nc-i) 3 h+k rrr^fl, r' e * 
(31) M(h,k,t)= r(a+c .l )N n N « J o J 0 J-V^rp, l 


^a+c+h-i b+c + k-% jV r 2 -ja+b+c-pe 

,f . ”, r ' (J ~W 


x(l-0) a * e *0 ~^ l d9dt l df,d9 l . 


M(h ) k 7 t) exists for all positive values of h and k and for all 
^positive integral values of /. Since the integrand is an even 
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function of it follows that M(h } k, t)*Oior t an odd integer. 
If we let set in (31) and make use of (25), we 

find, 

M(h 

/ nn,K f ^v/ a/^n* 

' * 

' r(b+c+k+i))r(a+c+h+i/)r(\t+-L)r (c-rU) 

x r(b+c+vjrfa+c+7/> rfc) 77J) ' 

The2i/-th moment of the correlation coefficient can be found 
from (32) by letting Thus, 

M , j / a ; a /„ ) v _r(^-hi)r(c+-i/)r(a+c)r/b+c) 

' z-j (ro> ~[ sz / r(a+c*T/)r{b+c+T/)r{c)r(-L) 

The variance of r 0 is cf *=> which does not dif- 
fer appreciably from §zj , which is the sampling variance of r 
when it is computed from aJ^y . The distribution of r 0 is found 
by setting t*r 0 in (2Q) and multiplying by rfa+c) 
r(b-f-c). ' * 

The Z t/ -th moments of the regression coefficient of y on 
x, p « |« say, is 

/, , f y. . „ /N,) z ^/ov'f v ’r(a+c-^)rM+i)r(c+-b) 

M z JphMf-z^o,Mh(^) 77a7±^jf^)rW 


and the variance is 


N, z (s-1) 


Vp- tfjZZXWZJs* ’ <r* 

This differs very little from the variance JTj 
gression coefficient using only the data from . 


of the re- 


Slightly more accurate estimates can be obtained for , 
r z and rcr K a w by multiplying <f OJ % and % by >7vfe 


and jry respectively. These corrected estimates will have their 
mathematical expectations identical with a* , a z and ra x Oy , 
as will be seen from M(1,0,0), M(0,l,0)&nA MfOjQJ) in (18). 
In this case, the general moment M(fyk'Zi/Jwill be identical with 
(32) multiplied by 

s 


/N, ‘) n /A/ z JY 
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The variance for r* in this case is j“j* , and that for the re¬ 
gression coefficient is 

(N r Z)a$ 

(s-l)(N r A)o* 

5. Variances and covariances of > J% ^ 0> 

large samples. 

As we have seen in the last section, the product moments of 
to * % and evaluated from (30) are too complicated to he of 
much practical value, and there is not much hope from this source 
of finding the sampling variance of the estimate r a of the correla¬ 
tion coefficient. The moments and variances of and 
taken separately are well known results. In fact, for large sam¬ 
ples, the variances are j^audj3£ respectively. The variance 
of r 0 . is not so immediately obtained. We shall find its limiting 
form tor large samples from the normal form approached by 
the distribution of , J^ 0 and r Q as rn } n and s approach o o 
in constant ratios , -r . 

For convenience let ^ <p and r 6 =t . Then 

we have 

(33) 

If we integrate (1) with respect to x , 5?, x r and 57 and 
perform the following transformations on the remaining part of 
the distribution, 

= & (1+**)-<KrLL d ^ Ct G 

r)~ <P z (l+fi)-/5d drj ~zcp(l+/3)d<p 

S* te<p 

we cafi write it in the form, 
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(34) FfU'V&wSffuxetjR \ 

where s_ - m+n+Zs 

F(u v t e i 0 t t)~4C(U«,)(l~/3)(l-r*) z a^ m a y Sin e 2 

xu~ z v J S Z f Z j?i*« 9*+*uv)-t i 9 z <fi z J 2 


and 


s .i J12-1 n-L -s-m+z ^ -s-n-t-z ^ 2) 'x 


>/? 




"y 




2 222 - ^- — r(i)r(st f )r(*j?)r(eg)r(t¥) 


and 

f(u, ^ ^ 4, t) =U-r z ) 


‘5 - /-o^ 




^y 




a 



& 

z 


x [f/^4 z -3v)-T z e% z ] z 

_l _ \i + 4,e*-r z *,u 1^3# z -r z /S v 2rtd</>l 

x e 2 ( 1 -rv L ^ J - 


If (34) were integrated with respect to u and v, we would 
get the distribution of and ~t~~ecp 

where the distribution of f , ^ and ? is given by (30). The 
oroblem of finding the asymptotic normal form of the distribution 
of Q , 0 and t from (30) seems extremely complicated. How- 
c\ er, we can find this asymptotic form by first finding the limiting 
nmmal form of the distribution of u,v,6 , and t from (34) 

and then integrating with respect to u and/. 

The limiting normal form of (34) can be found by methods 
developed by von Mises 11 in a paper which appeared in 1919. In 
fact, satisfies all of the conditions of the generaliza¬ 

tion of his first theorem to functions of more than one variable. 
In particular, the first order partial derivatives vanish and the 

a 1 R. von Mises, Fundamentalsatze der Wahrscheinlichkeitsrechnung, 
Mathematische Zeitschrift, Bd. 4 (1919) S. 14-18. 
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determinant and all of its principal minors of the negative of the 

Hessian are positive at the point P whose coordinates are u * O*, 

v=a z , t*r. Furthermore, f is identically zero 

y x y 

outside the region of possible values of u< v, & , <p , and t. 

The matrix of the negative of the second derivatives at "P is 



df_ 

du 

df 

dv 

df 

Te 

df 

W 

df 

dt 

df 




+0r* 

*r 

da 

Zcr2a$p+ 

a/OyP* 


6f 

r**fi 


* A r* 


fir 

dv 



| |l 

bf 

be 


+/3r* 



-&IW 

df 

b<p 


'aju A 5 ) 

- zr*. ft +0 
& pz l 

k&>¥) 


df 


fir 

c x p*\l* P*) 

HS 


df 




Xow it follows at once from von ^ Uses’ theorem that 

. s 


(36)/^ rtWfyu,,'‘Ffeofa o y ,r)e 


y j 


zZ h x x 
L J L J 


where x,*u-o* , v-a z , x 3 ~ 6-^ , = 4>-cr y and x ; *t-r , 

and h L j is the element in the l -th row and j -th column of the 
matrix (35). Now, 

(1+4, Xu 6) 

F ( a x‘ a y> a K- a :\f, r )~ (zrr)io2ay(±-r*)\ 
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which is equal to (&)*& , where h is the determinant | h[j | . 
The variables in which we are primarily interested are Q , 
and t . The matrix of variances and covariances of Q * <fi 
and t is formed by taking the third order matrix in the lower 
right corner of the reciprocal form of || /7 / y|| . This matrix 
turns out to be 



The determinant of (37) is 


o£crf\fI-r*) 3 + (*+/3)(l-r*)+<0d+r^ 


4s 3 a+py 


The variance of r 0 is given by the element in the lower right 
corner of (37). It can be readily shown that this variance is 
greater than the variance of the estimate of the correla¬ 

tion coefficient from aJ xy only—a rather surprising result. 

The efficiency of Q , <$ and t taken jointly is the ratio of 
the reciprocal of the determinant of (37) to B(m,n,s) in (13). 
That is, 


(39 )Eff(e$t)» 


a-r 2 ) s U+*) z U+/3) z _ 

I ~(1+* Xl->/3)-43r+\ [V 1- r z ) 3 +fa +/3\ 


which is less than unity except for the cases r-O and j,=/3*0 . 
6. Efficiency of the system O , <f and • 

If we use j$--= r, , say, which is the maximum likeli- 
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hood estimate of r hum X L . instead of r 0 in section 5. and use 
the foregoing analysis of von Alices, we find the following matrix 
of variances and covariances for the asymptotic normal distribu¬ 
tion of 6 , (p and r t : 

(40) 


e 


(p 


n 


The determinant of this matrix is 

a*c* U-r z ) 2 \(l+*Xi+l5)- j<*+/3+z\ 
1 As 2 U+*)*a+/3) z 


e <p • _ r. 



r 2 a }6 c/y 


Zs(l + >i) 

Zs(l+*)(l+p) 

Zs (i -t ) 


~z 

a y 

rOyU r*) 

Zs(l+4,)(1 +&) 

Zs(U/8) 

z$(U/3) 

ro x (J-r*) 

rjytt -r z ) 

(1 -r z ) z 

Js TT+ZT) 

ZS ( /+&) 

s 


whose reciprocal provides us with the amount of information re¬ 
lative to cr y and r yielded by the estimates 9 , (p and r, . 
The efficiency of this system of estimates is given by the ratio of 
the reciprocal of (41) to (13). that is. 


By comparing the systems &, p , r 0 and 9 ,<f> , r t , we actu¬ 
ally find the latter to l)e more efficient, since 

(!-/'*) *\U_■>■'=<')(1+J3 )- j- ( V v /] ^ 


which is the ratio of the reciprocal of (38) to that of (41), The 
equality holds ony when n-O or 4*/S*0, 
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The distribution f(z,w,t)oi X 0 -z, % =l ^and r,*i can be 
readily found from the distribution of £ , 77 ,Z,u and v, which 
is included in ( 1 ), by making the following sets of transforma¬ 
tions in succession, 


(a) r=zS ¥?, 




(b) 


(c) 


77= l+/3z-j8v 


(j* 


us ’-^r-zfl-0) 


Vs '£*(1-0) 


d $ * 

d7?=l+j3dw 


du * - £ d& 

dv~ - 


The result can be expressed in dosed form as the definit 
integral 


S-4 s+rn-4 s+n-* 

’-tv ~ lr ~ 

/ / „ S~3 7T7-3 S-3 Zti 

K f[e~ 9 e ^(i-d) <p (l-<p dddtp. 


(42) f(z f w,t)*K(J-i*) 

ri ~g *tt 


where 


/V/^/V*-^ M-z N 9 -Z -rt t +z -d z +Z 

„ ri) 1 v, * <r y ( 


_s-J 

X. 


r^)rf^ i )r( s -j?)r(^)r (*f) 
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y z(i-rV L 


+ 


N z \l-r z (I-4»'y' ZrtS N,N z e$Zw 
0y “ 0x m 


When r=0 , (42) breaks into the product nt thiee v\ ell 
known functions, two of which represent the distributions of the 
variances in samples having s+m-Z and s+n-Z degrees of free¬ 
dom, and the third which is the distribution of the correlation co¬ 
efficient in samples of s items from a normal population in which 
the correlation is zero. 

IV. Summary. 

Samples are considered from a bivariate normal population 
of ^ and y in which all of the members are not observed with 
respect to both x and y . Such a sample is broken into three parts 
<J x y , and <Vy , where a) X y is the set of s members observe^ 
with respect to both z and y, the set of m members observed 

with respect to x only and uiy the remaining items observed with 
respect to y only. 

Maximum likelihood estimates are found for the following 
sets of conditions: 

(a) For given values of <?y and r , optimum estimates 
are found for the means a and b . 

(b) For given values of < 2 , b and r y optimum estimates are 
found for and c^. 

(c) For given values of a and<£, approximations are found 
for the optimum estimates of , Oy and /\ 

Other sets of estimates considered are: 

(1) Means a and b estimated independently from the^s and 
the y*s respectively, of the sample u ). 
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(2) Maximum likelihood estimates of CX X from and 
and cr y from u) x y and uJy, each estimated independent- 

■ ly of the other. The estimate oi * s taken as the 

covariance from u? X y. The characteristic function of 
these estimates is found. 

(3) Estimates of ^ and taken as the square root of the 
weighted averages of the variances from u) y y and u > x , 
and from uj xy and u?y respectively, with the estimate of 
rtaken as the ratio of the covariance of o^>y to the pro¬ 
duct of these estimates of the standard deviations. 

(4) Estimates of a x and the same as in (3), with r 
estimated entirely from uJ x y 

The exact forms of the sampling distributions of the systems 
in (3) and (4) are found, as well as the asymptotic normal forms 
approached by these exact distributions as the size of the sample 
u) increases, subject to the condition thaty^<< and are 

constant. The limiting value of the variance of the estimate of r 
in (4) was found to be less than that of r in (3). 

We have defined the amount of information available in a 
sample relative to any set of population parameters as the reci¬ 
procal of the determinant of the matrix of the limiting values, for 
large samples, of the variances and covariances of the maximum 
likelihood estimates of these parameters. It is shown that this 
determinant is smaller than that obtained from the asymptotic 
normal form approached by any other set of estimates of the 
same set of parameters. The amount of information relative to 
the parameters utilized by any other set of estimates is the reci¬ 
procal of the determinant of the matrix of the limiting values of 
the variances and covariances of this set of estimates. The meas¬ 
ure of the efficiency of any set of estimates is taken as the ratio 
of the amount of information yielded by this set to the amount 
yielded by the maximum likelihood estimates. The efficiency thus 
defined was found for each of the sets of estimates (1), (3) and 
(4). It was found that the set (4) is more efficient than set (3)5. 

J. 



ON THE SAMPLING DISTRIBUTION OF THE 
MULTIPLE CORRELATION COEFFICIENT 

Bv S. S. WitKS* 

The problem of finding the distribution of the multiple cot- 
relation coefficient in samples from a normal population with a 
non-zero multiple correlation coefficient was solved in 1928 by 
Fisher 1 by the application of geometrical methods. In his 
derivation he used the facts that the population value p of the 
multiple correlation coefficient is invariant under linear trans¬ 
formations of the independent variates, and that the distribution 
of the multiple correlation coefficient is independent of all popu¬ 
lation parameters except p . 

In this paper it will be shown that the distribution' of the 
multiple correlation coefficient can be derived directly from 
Wishart’s 2 generalized product moment distribution without 
making use of geometrical notions and the property bf the in¬ 
variance of p under linear transformations of the independent 
variates. . Furthermore, it will not be necessary to show that the 
distribution will be independent of all population parameters 
except p . 

The population value of the multiple correlation coefficient 
between a variate x, and a set of variates x z » * 3 , ■ • • * n is 
the ordinary correlation coefficient between x t and that linear 
function of the variates x z ,x 5) . x n which will make this 

correlation a maximum. It can be expressed as p z *l- , 

A/ 

where A is the determinant of the correlations among all of the 

♦National Research Fellow in Mathematics. 

‘R. A. Fisher, The general sampling distribution of the multiple cor¬ 
relation coefficient, Proceedings of the Royal Society of London, series A, 
voL 121 (1928), pp. 654-73. 

’John Wishart, The generalized product moment distribution in samples 
from a normal multivariate population, Biometrika, vol. 20A (1928) pp. 
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variates x ; , , • - x^ and A, is the determinant of correla¬ 

tions among the independent variates % z ,x 3 , * * x n Denoting the 

sample valueof p z by7? z \t is well known that , where 

Z) andZ} are the determinants of sample correlations among the 
sets of variates x z ,--'* n and * z ,x 3 , * * ^ respectively. 

Let us suppose a sample of N items to be drawn at random 
from the normal n -variate population whose distribution is 


( 1 ) 


(*rr)f 


r? 

~j£ - 4 . ; (x, * /T?;XX . - 177.) 

g * LJ*1 V * 1 J J 


where A ;; = 
v 





the determinant of correlations 


among the n variates, /j,-, is the cofactor of p, -in A , O: is the 

LJ j | * V * 

standard deviation of ^ and /I = J A L j . 

In the sample, let 

/ * 

X; * ± Z , 

* /V *-/ ** 

and 


where is the value of X; for the -th individual of the 

L f, 

sample, Wishart 3 has proved that the simultaneous distribution 
function of the set ^ a -], (L f j=l,Z t • * • -7?)is 


( 2 ) ffoh 




* A 


N-l 

2 


ff 2 






H 


2 J. Wishart, loc. cit. 
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where | a- LJ | is the determinant of the a's. 

We shall define a moment-generating function <p(°L,k)n <, 


(3) cpU,k)=fe^ | K f(d)da, 


where the integration is to be taken over the field of all 
values of the a’s andj^^j is the cofactor of< 2 ;/ in 


possible 


a 




From this definition of <p(*,k \it is clear that (p(*c t k) 


* =x 


is the product moment L will be shown that this 

expectation exists for h=-k which will yield the k-ih moment 
of (1-R Z ) , from which the distribution of /P can be found. 

To find (p^c j k)v7t observe that since (2) is a probability 
function, its value over the field of all possible values of the <2‘S 
is unity. Hence, we must have 


(4) 


/- 


/V y A-n- 


a 


v 


A/-77- 

—r 


da = e. 


where 


G = ' 


77 


a &*r(*&)r( ! ¥i~r& 1 ) 


1 77///-/; 


7F 


This relation 


holds for all positive values of N?ti and for all values of Alj 
which will make the matrix || A[j |j positive definite. 

If f(a) be integrated with respect to a lt , a iZ t • • • a /rj f 
the resulting form will clearly be the distribution of the set of 
as contained in I a pq I and will be 
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.tn-iXn- 2 ) _///-/ 




where B p q is the element in the p-th row and q-ih col¬ 
umn of the reciprocal form 4 of the determinant which is the 
co factor of the term in the first row and first column of the 
reciprocal form of \ A L j j. The value of B p q in terms of correla- 

s\ no 

tion coefficients and standard deviations is-— , §% , where 

CT„clA 0) 

0 ) <0 p 4 

A ~ A and A is the cofactor of Pp^ in A f/ . Further¬ 
more, 1 B&a I. Hence 


Ny A _ 

' ~ p r\, f CL, . 


d- g-l 

i < 


da, <d\f-a\ 


Jtl .±1 daL Bpqapq 

-rrVgJ z (i) r(*i*)e 




where cLa, = da f/ da /JS • * da fJ1 and is the product 

of the differentials of all a.’s in \o>pq\ (p,q*2,3 / * * * * 77 A 

Xow. it is clear that (6) is an identity for all values of 
A/ ami the population parameters^ and Py (L t }=l t 2, • • • m ; 
i /j) , for which both sides of (6) exist. Thus, we can per¬ 
form the following operations on (6) : 

(a) . Replace N by N+Zk . 

(b) . Replace d^ by d L i N * ( l = 1,2, • • • n) 

4 By the reciprocal form of a determinant | c,, | we mean the deter¬ 
minant formed by replacing each element c LJ by the ratio djJL where 
C L j is the cofactor of c lJ and C=\ c t j\ c 



200 


MULTIPLE CORRELATION COEFFICIENT 


(c) . Replace A u by A n -—. 

(d) . Multiply both sides of the identity by 

(e) . Multiply both sides by \<2pq\ . 

Accordingly, we find that the integral of the left side of (6) 
over all possible values of the a's is the definition of <Pfa t k), 
which must be equal to the integral of the right side over the 
field of all possible values of the a's in \apq\ But the value of 
the integral of the right side can be deduced at once from (4). 
Hence, we finally obtain, 


-k N-l 

( 7 ) <P(+.k)*(%) A~ r A H 


N-J 


-k k 


r1 £L jP +k ) 
~n m p r ) ’ 


where A K is the determinant A with A n replaced by A if -~jqr , 
and £It is the reciprocal of the cofactor of the element in the first 
row and first column of the reciprocal form of A ^. 

That is, 

A ”' 1 

( 8 ) 


*4- 




where is the cofactor of the element in the p-th row and 

The value of 


q-th column of A^ , (pj<l=2, 5, * • • r?). 
can be readily found by writing 


*P<? 


lA 


Upq\ 


W/l 


where A e 


exce pt and A <n ’A,,-ufir . 

I A j,pq \to th order determinant by inserting, 

as first row and first column, an additional row and column which 
will not change the value of the determinant, and multiplying it 
by we find 


Increasing , 


Upq 


77’Z 


= Al (A„~w)- 
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Therefore, 

(**- W) ‘ 

Substituting this for 3 U in (7) and using the fact that 
A u , we finally obtain 



Thus, it is evident that <f (a, k) exists for sufficiently small 
values of <4 Let us write 

_ A 

'NA 




1- 




.cu 

2 




and expand the second factor on the right into a Taylor series. 
Substituting in (9), we have the convergent series 


, /ha fir rf*?**) 

k ) = (zaJ ~ 


( 10 ) 


x L 
t*o 


f A/ ' / / 

(fit-l ' ' (ffa-rJrfV+O 




For the coefficient of 'fir in the expansion of the right side i 
(10) In powers of 4,we find 


( 11 ) 


(■ 


a/a 

2 


“) (fix) 


r(*jP) 


*• T' J— rijislu.. -:-.-.- 

*' a Hrffifi) r( I fi l + k+i ) 
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which is the definition of E \cl,, (L~F 2 ) j , We observe 

that (11) exists for all values of k and h for which 

and > q Placing h*-k 

and pointing out that ~a^A„ * » we have as the k-th 

moment of 1-F Z , 


MjjU-rt]= e\i-f 2 ) *]= 

r(^) r(*¥) l *° i!r ( t T +k+i ) 


By using the relation 

_ _j_ _ r' OEp+k-i 




e 


dd 


we can write (12) in the form 


r ri-pv^ 

e\[i-f) ]= 


<13) 




** r' 

£ 

i-O /_ 


e zl d-9) 


z$n+t<-i ozl+i-x x /dzJ ) 

« £±J— ) dd 


777{*7+ L 


) 


The series in (13) is uniformly convergent in Q for OiQ£1 
and therefore, we can interchange the order of summation and 
integration and write 

(14) £ [fl-FV *] k 4(e)d&, 
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where 


(IS) 


</>(&)- 


S. S. WILKS 

g-p*) ZSkfl e *&- J 
r(X=±)r(i^) 


&D 

xL 

1=0 


P ZI e t r z ( 




Thus, we have a distribution function of a variable & such 
that the * //^moment of & is identical with the k-th moment of 
P fot all positive values of k . It follows from SteklofFs 5 
theory of closure that 4 >(&)must be the only continuous solution 
of (14), where /rj/7~pV*] is defined as (12). Therefore, 
the distribution of P is identical with that of 0 and can be 
written finally as 


r(¥) 


df= 


(16) 

N~J N-n 


r^Jrm 

-Wv ‘fyp-Wfi'eyierl. 


which is the distribution found by Fisher except that he uses the 
notation ??, - 77-i, the number of independent variates, and 
n f +n 2 * /- N 7 the sample number. 


*\\ f . Stekloff: Quelques applications nouvelles de la theorie de ferme- 
ture au pi obleme de representation approchec des functiones et au probleme 
des moments, Memoire de 1* Academic Imperial des Sciences de St. Peters¬ 
burg, vcl. 32, no. 4, (1914). 

xf. jf. 




CURVE APPROXIMATION BY MEANS OF 
FUNCTIONS ANALOGOUS TO THE 
HERMITE POLYNOMIALS. 


By Herrick E. H. GRirvuuF 
I. Introduction 

In an article by J. P. Gram entitled “Ueber die Entwickelung 
reeler Functionem in Reihen mittelst der Methode der kleinsten 
Quadrate 111 a unique procedure is set forth which leads to a very 
great simplification in the usual method of curve fitting by the 
method of least squares. That this method has not been given 
more consideration is probably due to lack of knowledge of its 
existence, rather than to lack of appreciation of its merit. Edward 
Condon, 2 Raymond T. Birge and John D. Shea 3 developed for¬ 
mulas by means of which curves can be fitted to certain types of 
data. Later, Harold T. Davis and Voris V. Latshaw 4 developed 
specific formulas, with tables of coefficients, by means of which 
curves of the second to the seventh degree can be fitted to the 
data with a minimum amount of computation. In a later paper, 
Professor Davis 5 has employed Gram’s method and in this way 
has developed a set o i functions analagous to the Legendre poly¬ 
nomials. 

The purposes of the present paper are: 

(1) To develop formulas for fitting curves of the second 
to the sixth degree to given data by the method of least squares 
where the n+1 frequencies of the data have the terms of the 
expansion of (j + ~ ) ^as weighting factors. 


journal fur Mathematik, Vol. 94, 1894, pp. 41-73, especially pp. 42-46. 
2 “The Rapid Fitting of a Certain Class of Empirical Formulae by the 
Method of Least Squares.” Univ. of California Pub. in Math. Vol. 2. 
No. 4, pp. 55-66, March 1927. 

8 “A Rapid Method for Calculating the Least Squares Solution of a 
Polynomial of any Degree.” University of California Publications in 
Mathematics. Vol. 2, No. 5, pp. 67-118, March 1927. 

4 “Formulas for the Fitting of Poljnomials to Data by the method of 
Least Squares.” Annals of Mathematics, Second Series, Vol. 31, No. L 
Jan. 1930, pp. 52-78. 
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(2) To develop by Gram’s method a set of functions 
analogous to the Hermite polynomials, by means of which curves 
of the second to the eighth degree can be fitted to data under the 
same conditions as in (1). 

(3) To study the properties of these functions, finding a 
generating function, a recurrence formula, a second order differ¬ 
ence equation, and giving other methods for deriving them. 

(4) To apply the functions in finding a curve to fit given 

data. 

(5) To furnish tables to facilitate rapid calculation of the 
coefficients of the required equation. 

II. Development By Ordinary Method of Least Squares 

Suppose we have given data in which the variates, x , are 
equally spaced, the observations being weighted with the binomial 
coefficients, having the origin at the mean. Thus, let the given 
data be 


X 

-i 

x -p x -p+l x -p+Z 

X-! x t • 

x p 

y* 

y-p y^p+i y-p+z ' 

y-/ 36 y, 

yp 

"St 

C-p C-p+l C-p+2 

C-, c 0 c t 

Cp 


where is constant, 

, a o)i !*»*!»* 

an * Cp-f-x) 1 (p~x)f2 2 

Since the x differences are constant, it is possible, without 
the loss of generality, to replace the x 5 by their corresponding 
subscripts. It is evident that the problem as set forth deals with 

““Polynomial Approximation by the Method of Least Squares.” 
Annals of Mathematics. 
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data having an odd number of classmarks. If there should be 
an even number, the data can be modified by leaving out one of 
the end variates, or by adding an item by extrapolation. It is 
also possible to transform these functions by moving the origin 
to the extreme left, thus having x vary from o to n . If this 
is done, the functions in part 3 of this paper will reduce to those 
generated by equation (18) in Gram’s article. 

It is required to find the coefficients <2/ ^ in the equation 


(I ) y -y'C x ~[ a n,o-> a nJ x + a n,2* Z + . a n7?*”] C x 

such that 

5 C X y x- a n,o- a n,l x - a r,,2** ' 


shall be a minimum. 

In this section, the ordinary method of least squares is used, 
leading to the n+1 equations 

M o * m o + a r7,2 *”2 + a n,4 . 

M > = a nJ m 2 + a n, J 171 4 * a n, f ^6+ . 

( 2 ) 

~ Q r) Q TTr)^ ■+ a n ^ Tr ' J 4 + &- n 4 rn ( > . . 


M, 


3 = <2 


'ft, 1 "% 


m„ +a 


nj m 6 + a r , J S' 




(3) M=£ y and m =£ 6 

x=-p x*-p * 


(It is evident from the symmetry of the distribution that all 
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moments, m r , with r odd will be identically zero.) 

These n+1 equations can be solved more readily by dividing 
them into two sets, one containing the coefficients of subscripts 
a n,Zr > t ^ ie ot * ier set containing the subscripts 2r+ i • 
Thus we get 

~ G-rj, 0 77 7 0 ^ G-yj g mg + Q n 4 + . 

(4) M z •a„' 0 mz+* rit2 T» 4 + a„ tA n? t + . 

= a r, t o 777 ** a n / Z 777 6 + a n, 4 m 8 + ‘ 

and 

M l = a n, 1 m Z + a r,, 3 m 4 * S m 6 * . 

(4a) M 3 = a n> 1 rng, , 5 m 6 * a n , ^ m 8 * . 


M S = ^ l 777 6 +a r,,3 7r7 e + a n,5 rn 10 + . 

The computation of the moments, m r , may be accomplished 
in the following manner: 

... ^ (2p)l 1 . + , 

(5) C *~ (p+X)!(p. X )l 2 zp 1 he 

general term in the expansion of ( z ->-) ) Zj ? 

_ (Zp)/ 1 )p+x)!Cp-x)! p-x 

C* = (p+x+l)i(p-»-i)l Z Zp ' (Zp)/ p+x+Z 


( 6 ) 




Multiplying each side of (6) by (x+l) and summing from 
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x--p to x = jo , we get 

£ C x+ 1 (p+X+l)U*l) k -£ (p-x)(x+l) k C x or 

**yO ~ P 

P k P k+1 p k P k 

Z2 C x+i (p)(x+l) + Z C x ^ (x+1) = Z p(x+l) C^-ILxfxpDC^ 


-p\£ c * * 


k-Z 


‘$[ C X* ^ J +(l) C x* k + ( V C x X *' J * • • 

By virtue of (3), this becomes 

p m *+™ k+i = tfc 7 * ^(ihfci + (V 


m k.Z + 


(7) 





Combining the terms in m L , and recalling that all odd 
moments are identically zero, we reduce equation (7) to 


( 8 ) 


Z 777 


k+1 





By means of this recurrence relationship, the moments are 
found to be 


*r J 


m zz 


3p 

m 4 m ~Z 


a) 


p 

T 


ftp 

S- 


/5p M . p 

4 2 
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(*) 


, mp* > ; 62 j/*\ p 

+ 44 + T~ 


TT1 

10 


m lz 


2205 P (3) , 255p {2) P 
= 32 e a 4 ~ 

. W95p<* > ; 51975p fS ) 65855 p" > ; ^ <» 


64 


32 


!6 


lQ2?p ( *> p 
4 + Z 

135135p (7) 945945 P M 945945p (f> _ 

m "iJT~ ' —& - '—75 — ' 7/ ^ 

197195p fs ) 4095 p™ . p 
8 4 Z 


where p rn) = pfp-lJ(p-Z) • ■ • -(p-r?+l). 


When expanded, the values are 
m 0 *l 

p 

m z‘l 


P 73P-D 
4 


m,-. 


'V 


p7I5p z - 15p + 4) 

8 

p7105p 7 Z10p z +147p -34) 

16 

p7945p4 3150p 3 + 4095p z - 2370jP + 496) 

32 


p71Q593p s - 31975 p 4 +W7415p *- 111705 p* 
64 


. 56953p-11056) 
+ 64 
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_ pfl?5Wp 6 -945945p *+ Z8378?5p+- 4379575p % 

t* m jza 

, 4114110p 2 -1911000p + 349504) 

* /ZQ 

Knowing the moments, it is now possible, by explicit cal¬ 
culation, to find the coefficients, a n L , in (1) for special cases. 
Case I. Linear 


y=(a Io + a i t i x ) 


(4) and (4a) both reduce to 

single equations 

"o- 

o ^ o * 

/Vj - rr> z a 1;L , 

(9) (a) 

m o 

(b) 

V 

c2j j = 4j 74'j , 



where A 

Case II. 

Quadratic. 


y * * a Z,l * + a z,Z* Z) C x • 


This leads to the two sets of equations, 


M o = * m z a iz> 

(a) 

M Z = * ^A a Z,Z> 

and 


(b) M 1 * m z a 21 . 

(Notice that one set of equations in each case is identical 
with a set in the preceding case. Therefore, only one of the two 
sets needs to be solved.) 
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Solving (a), we have 


( 10 ) 


a 2,O m A Z M o + S Z M 2> 
a 2,Z xB 2 M o +C Z M Z> 

where A z - > B Z 


Zp-i 


Zp-1 




pap-i) 

Case III. Cubic. 


The resulting equations are 


(a) 


M o-^ 0 ^,o +nrJ z a 3,Z> 


M Z~~ zn z a 3 i O+ rn 4 a 3 l Z> 


(b) /V7,= m **3.Z + *”4*3,3 > 
M 3 = m 4 a 31 + 777 6 a 3 3) 


x?) Cx 
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( 11 ) 


°3,o m a Z,o = A Z M o. + B Z M Z> 

* *2,2 - B Z M 0 + C z M Z> 
a 3 t l * A 3 ^1 + B 3, ^3 > 

where A3 .m^i£*±>, 

„ -4(3P-1 ). 


C 


3 


s 

<d 3 = 3 p(p-l )(2 p- 1 ). 


Case IV. Quartic. 

V= ( a 4 t O +a 4,l* +a 4,Z* Z+ %3* 3+a 4.S )C * 

The coefficients with the second subscript odd, are identical 
with those in case III. The other coefficients are found from the 
equations 


M <r m o 

Mg* m z 0 4 777^ £ 

M f m 4 *4,0 + *”6*4.2 


*4,1 

' a 3,l > 

<2 

* cz 

^5 

3,3 


+ m 4 a^ 4 , 
^ ^4.4 * 

* rr> 0 a 4.4 * 


and 


a A,o * A 4 M o 
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where 


a 4,Z + + E + M A- 


a 4.,4 " C 4 M o + E 4 M 2 * F 4 M 4 


a . (JlPlilP + tj , 
4 Zd' 4 

» - 10 (p-l ) 

3 *~ < ' 

c 4 * -4 ■ 

4 <2& 


d^( 2 p-JJ( 2 p- 3 ). 


n_ 4(24 P 2 - 39 p+ 17 ) 

V 4 _ 

- 8 ( 3 p- 2 ) 


d-- 3 p [p -l)( 2 p-lX 2 p-t 


Case V. Quintic. 


* m(a S.o~ “&*+*&* + a sp* +a S,A* + a sp*K 
As in the previous case, we have at once 


~ a 4 t o' 
a S, 2 S a 4 t 2 > 
a 5,4 s a 4,4‘ 
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and 

M r ™z a v + m 4 a n + m 6 a sp * 

™ 4 a sj * ' '” 9 ^ 5 -' 

A ^-“ rr> 6 a s i i *■ ^a 42 ^? ' 77 /<7 

from which 


+ B s M ,+ C , M s> 
(13) ^ 5 . B S M, +J? S M, 4 E S M S , 

a. ss . ^C f M t 4 E^/% *F" S M S , 


where 


V 


3 S 


XL 


P’s 


(? 25 p*~ ZIP Op 3 4 29J5p z -1480p 4276 ) 

cif 

~(20)(21 p 3 -6Jp z t y6p-!2) 


^(12£^zS2£j±. £/ , 

d-s 

AQ02p*-Z7p+l6 ) 

-QO(p-l) 

' —ir— » 

% 

/6 * 


» TT 


d s = /^pCp-l)(p-2X2p-l)Up-3). 

4.1 » ^ 

y- ^4** a P,l l£ * a 6,z x ** a 6 t y x7 ' t ' a d J 4‘ c4 * a 6,S JC5+ %6* ^x. 


As before, we have 

a <5,i * ' 

a *? ma S,3' 
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From the equations 


A£= 

rn o a 6 t o + ” 7 Z 


+ * 7 *°<v 


m z a 6 t o +m * 



11 

w Ao + w d 


+ 777 d?4 

M 6= 




we obtain 




a 6 J o / ^(> r ^o + ^6^z + ^6 y %* 

(14) *6,£ S 6 M o + E 6 M Z+ P 6 M *+ G * M 6> 
° 6i J C 6 M C+ E b M Z + &6 M 4 + 1 b M 6‘ 

“6,6- D 6 M o + G 6^Z + h M 4 * J b M 6, 
where 

, 3 (T)P!40p z + 147 /d - JO) 

4 6= __ 

« T(lJp z -4Qp+Za) 

B . 6 - TP -' 


^ !4(Jp-J) 

' 

A* 


'*■' 4 ' 


4 '= 7(Zp-l)(2p-j)(2p- 5). 

E . 2(J26jp 4 -l8Z7QpKjJ9l2p z -4J9?Op - /x/O 
6 <4 

- 20(171 p*- 68Ip 2 -, 64b P - 3^4 
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n 8(45p z -105p + 46 ) 
6 ~ Ct 6 

40(51 P Z - 147P + HO ). 


t 6Q(3p-4) 

l *~ dO b 


4- 


25L 

d 6 


a 6 = 45p(p-l)(p-4)(2 p-l)(Zp-5)(Zp~5)- 


Tables for all the coefficients, A /f A z , 3 Z - • • to , 
to ten significant figures for p from 1 to 20 will be found at 
the end of this paper. 

Special attention is directed to the last coefficient in each case, 
a r r , as reference will be made to it later. It is desirable to 
be able to compute this coefficient without having to solve a set 
of equations. 

Let represent the determinant 


m o 

m 2 

m 4- . . 

■ ■ m n 


™4 

m 6 

* ' 771 774- 2 


m 6 

777 3 

■ ™n+4 

m n 

f) 7_ , o 7 77 - * 

77+2 7?+& 

' m Z7J 
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integar, and 




Tn o m Q ' ' ' 

rn n+3 

m e * • • ' 

' m r, + 5 


I 777 ^1 * /r7 g r? I 

for 77 odd. P>y definition 

p (-2) - • 

Lot denote the *anie determinants, the last column 

being replaced by M Q t .A/ or 

M 1 1 M 3i M 5 - • • M n according to whether rj is even or odd. 
Similarly, r*x) will represent the original determinants, the 

last column being replaced by 1 J f. 77 or by 

X, x. j x , .^ tor ;7 even or odd respectively. 


It is clear from the normal equations in case r , that 


_ Pfr, M)_ 


™ 0 ™ 2 • ■ 

• • 


^4 ‘ • 

* 

■ 

m r m r+Z • 

• * * 

■ M r 


m o 

™z • * • 

• - m r 

m z 

’ ’ • 

■ m r+Z 

m r 

"W* * ‘ 

■ ' m Zr 


for n even. 
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m z 


Thus, for r=4 

m z 

1 

m 4 

M z 

we have, d =■ - 




vv ' i 

m o 

™z 



m z 


m 6 



W 6 

m e> 
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6/4/2! (2jo) f6J (Zp) (4} a P ) 

g 2 fb + 4 + Z ) 


Similarly, since p^j « -^§- 


P f3) 


J! 1 / (2p) ^ } f2 p) 


2Zf3-n) 

It is clear that for n even, or odd, 


/m\ -r> n«T7-£)!(»-4)! . ( 2 p)' (Zp) 

1 iy ) ~ — : ~ 




2{rr-t T7-2 -n-4- .) 


each scries ending with n- n= O or 1, for n even, or odd. 

With the recurrence formula (17) or the general formula 
(19) it is a simple matter to evaluate "Pfn) a list of which 
follows: 


„ _ P(2p-1) 

p ur — 

3 p z (p-I)(2p-l) 

ID - --—-—- i 

Vs) /6 

„ 3p z {p-l)(2p-lf(2p-3) 

P U) - 32 J 
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^ 43 P 3 7p-l) Z 7p-2)(2p-l) Z {2p-7) 

^ - Ifc - 

r, 13?p 3 7p- 0 *7p-Z)(2p-l) 3 (Z p-3) Z (J?p-5) 

- !o& - 

„ 14J75p+rp-J) J rp-Zj%-3)f2p-j/fZp-3) Z (2p-5) 

**»' ,<004 

t> * 2 > 232£. *(p-v’(p- 2) z (p-})tep-i) * 

32 , 76 a 

72p-?) 3(2p-5) z (2p- 7) 

72,768 


’ III. Development Hy Gram's Method. 

Let the variates. X t the observations, . and the weights, 
C# , he given as before. \\\ assume that there exists a set of 
functions 

7*), 4,7*), <4 7x) • • - • <fl n 7x) 

of degrees 0. 1, 2,. 77 , respectively, each of the form 


( 21 ) $ r fx)=b 0 -b b t x + b z x z + ■ 


6>r* r > 


and such that 


'(a)L C x <f) m fx) <p n 7x) = O for m* 


~P 


)1 


(22) 


& r -I 

(6)22 C (j> r (x) 

\-P L J 


S r+<>. 
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the* value of 5^ to be determined later. 

We wish to approximate by means of these functions a 
function of % , 

1 23) y = y'^ « [a 0 <p 0 (z)+a l ((> l fx)4a z ^ z (x)+ • - ■ •+aJ n (x)\C x . 

\.hidi will he the equation of a curve fitting the given data. 

Multiplying each side of (23) by (f) r (x) and summing 
from x--p to ±p —applying (22),—we have 

{24) 72 y x (j) r M - o. r 72 = a r ^ r • 

~ P m P 

Substitute in the left member of (24) the value of (fr r (x) 
in (21) and we have 

P P P P 

bJE y^+ b.E y x + bE y^x z + - - • +b„E y x =a r ^ r , 

X -p ^ 2 -p ^ r -P ** r r 


which, by (3), becomes 


, n „ b o M o + b , M , * b z M z + . +b r M r 

( 2 S) a r = - 3 ^ 


This value of a r is identical with< 2 rr found by the first 
method as may be shown in the following manner: 


Let 


f26) J-Zc3y^a 0 4> 0 (*)-a,4,{x) - ojjx)} , 


which L to be minimizedpvhete ’ 
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Taking the partial derivative of with respect to a r , we have 

<4 m- o,4, <*> .- ^ 

which reduces, by (22), to 



or 



as in (24) 


Jo * 

We are, therefore, able to write the values of — in (?5) 

o r 

by comparing the coefficients of the moments, Af/, in (25) with 
those ina r r in equations (9) to (14), or in (16). 

Thus, for rs 4, we have 






"* 5, ' ^ 


(28) 

4 ' = #-< 2 . 

4-C4, J,, 4 = 4^ 

o ' z 44.* 4. 4 4. > 


(29) /. ^ ft).S.C, * x+. 
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can be 


Now the value of 5^ 22 C x k«]' 
found by comparing the coefficients of C x x r M r in the expansions 
given by the two different methods. In c^se IV, we have 

V c 


y 


= r 


+ °-4 4 X * 


E A M z x.\cj y l o x. 4 +X.trms of lower degree in *) C x * 


so the desired coefficient is 
By (23), we have 


y« <f >4 (*)C X + a j (fij te)C x + . 


* ^ M • 5. F x*C + 
O^, 4 4- 4 X 


But by (28) b 4 => S 4 F 4 and the desired coefficient is 


S.F z 

-^-■s 4 f 4 =s 4 f 4 . 


Equating these coefficients, we have 


W ' 


(30) .-.3,4- 

1 <4 

Therefore, 5 r is equal to the reciprocal of the coefficient 
oi M r in & r r . But examination of (16) shows that this 
coefficient is ^(r~2) 
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(31) 




p (r) 


r P(r-Z) 

The coefficient of a; l in & r (x) can now be found by 

^(r) _ c 


Cn-z) 


multiplying the coefficient of M t in a r r by -p 

Indeed, it is possible to express 4> r (x) as the quotient of two 
determinants The coefficient of x l in p (r,x) will be identical 
with that of M L in P( r ^ yj)* We may, therefore, write 



Proceeding in this manner, we obtain 

0 2 (x)*x z -£, 


<t> 4 M= x 4 - Op-2)x*+ 

( 33 ) 4>J*h x s - 5(p-l)x?+ 0?P Z -z5p+6)x , 


4> b (x)--x b - 5 (^y+^'^ 46 s- 
6 (x) *x 7 - 7^**+ 1£?'>*J1*P+ J96 X * 

7 x 4 

(tQ5p 3 - 420P p - 90 \ x 
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!4(p-2U 6 + Z<l £ £i-f e + +t> x* 

10fp 3 -525p z + 74Zp-264- vZ 
- 2- * 

+ *$ p(p-V(p-*Xp-i). 

'l'he coefficient, ai , of ^ fa) is given below. In addition 
to the values obtained from (9) to (14), a 7 and a & have been 

added. The values of A,, B z , . J b are given in 

the tables at the end of this paper. 

a 0 =M o , 






;m) 


<4 • WVU- 




< 4 - *h M ** J e M o 

^.D y M, t G 7 M 3 *I 7 M r J r M T , 


V WS'VS 


/ V- D . / VW 
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d q JJ^1e1z^I£p1i±±Lbz22} 

"/ = ~ cLy 

_ !6(10?p 2 - 5!5p * *9&) 

6 7~ - d, - ’ 



2Z4('?p-5) 
. ■■■ -■■ > 


d 7 = 5>5p fp-0 (p-2)(p-D frp- l)(Zp-?)(Zp-5), 

A -£-- 

3 3 (2p-l)(Zp-3) (Zp-5 )(2 p- 7) 

„ - 16005p 3 - 525p 2 + 742P -264), 

s ° -^ 

_ U205p z -55p + 44) 

C 8~ d8 


n - -44Q7P-Z) 

±J ~ -- > 

8 d & 


It was 


£ - — 
E s~ d 8 ’ 


4 = ?l?p(p-lXp-2)(p-3)(Zp-l)(2p-3)(2p-5)(Z P j) 
shown above how the coefficients of x. * in <p r fe) 


could be found from those of M; in a„ . It is evident from 

L pp 

that, that with <fi r (x) known, the value of a r can be immediately 


determined by changing x L to and multiplying the result by 
F r = ^ I*® shown later that these $ ‘s can 

be determined independent of the determinants previously used, 
and more easily. o 7 and a 3 were determined from (fi 7 (x) and. 
$ Q fe)> respectively, and then checked by means of (16 ). 
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IV. Properties of the (j) Function. 

The similarity between the 0 functions just derived and the 
Hermite polynomials, of which these may be said to be the analog, 
is evident, and leads one to expect that there must be a generating 
function analogous e~ * by means of which all of the 0's 
can be found. ‘ Also, one would naturally expect to find a recur¬ 
rence formula and a second order difference equation analogous 
to the relationships existing between the Hermite polynomials. 
This proves to be true. We have, in fact, 

(35) C 3c^n Mlz (~l) A™ c x ( P + *> (n) > where 

(p+x)*' n ^~ (p+x)(p+x-l)(p+x>-2)- ■ • ■ (p+x-n+ 1). 

Expand the right member of (35) and divide both sides by 
C^ , and we obtain 

■b(^){p-xf r> ' z lp*x/ z) ■ ■+(-!)” *(p-*Xp+x) lrh +(-0 (p+x) j. 

A study of the expression in the bracket brings out the fol¬ 
lowing facts; 

(37) The coefficient of x 77 is (-Z) 11 , for it is seen to be 
equal to 

(-11 n -f"Jr-jJ”- 1 0)+(P)<-0w -(”)<-» oA■ • • 

(-D"af-n )• (zy-o n r. 

(38). If 77 is even, only even powers of X appear in the 
expanded form of the bracket; for r> odd, only odd powers of 
X will occur. 
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Consider the coefficients of x 71 ' 1 > x n ~^ t x 77 "^, etc. With 
r? even, the terms of odd degree in % in (will be nega¬ 
tive, and cancel the corresponding terms in the last parenthesis, 
(ptx)^ n ^ . If r? is odd, the factor (-1)* 7 causes the corre¬ 
sponding terms of even degree in the same two expansions to have 
opposite signs, and, therefore, to vanish. Similar reasoning holds 
for every pair of products in the bracket which have the equal 
coefficients, If rj is odd, this will include every 

term. If n is even, the middle term will be 

(p-x) (p-f-x) = (p-x)(p+x)(p~x - l)(p+x-jXp -x-2)(p+x-2 \,, 

the odd powers of,*; having zero coefficients. 

Therefore, either all terms are of even, or all are of odd 
degree. 

It is necessary first to prove that the relations (22) hold; 
that is, 


jd 

EC x 6 m M4 n M*Q t for >77/n. 


T7J J? X 771 


fm) 


To prove the first relationship, we may proceed aj 3 follows: 
Let 

5 ^ 4 m M 4„ (*)-L M Cj n (z) 

(39) e+1 

C X 4„M]~ , 

where we may assume without loss of generality that n > m, 
for if it is not, i j^/^can be interchanged. Using 
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the formula for finite integration, 


A* a ' 4 


we have 


+A%C*ki”c Mt 4„ M. . +&)”A XMA%J n M 

(40) 

/ 4 )”7+J 777//. , -7^/r 

+ { ' 1} A C x+m+l* n . 

Iictweeti the limits -p and p+1 . 

Now A^Cj^U)* A' 1 [C-j[) n A n C x (/>+x/ n} ], (by 3S) 



n n-1 

A C^p+z) 


(n) 


-nr T^hw^i! <^<^a 
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For the lower limit, -p , the factor p + x * O , and for 
, * 1 1 n 

.\A*Cj U) *+*-0 

X 77 ~P 

Similarly, each of the succeeding terms up to and including 

the term A^ T> l> rTf (x)A m 1 ^- x ^ rn 4^ (x+m) becomes 

identically zero because of the factor p+x or C x for the lower 
and upper limits, respectively. 

Since is of the m-th degree in X , A (*)and 

all higher differences are identically zero. 

(41) :.Z^C x <}) m (x)(f> r} (x)=0, for 

If 777 »J7 , the first 777 terms vanish as in the preceding 
case. The (m+l)th term is the only term left in the series. 

< 42 > 

“ m>r, ' A m * C x4Tr, (* +rn) - 

” 1 77 -/ / p 777-1 / / / \ m . / 777 J 1 

A {(-jjA C^Jpwm) J 



(Z p)! f p+x+m) 

Z (p+z 4 m)!(p-X-m)! 


But 
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(43) 


(~t) m (2p)l -/ i 

(p+x)!(p-x-m)i 

It is now necessary to find a function, u x , such that 


Au 


x 


Au « i _ , 

36 (P+* )l (p-x-m)l 

Since As(£-J) , we may write 

« (E-l) - « -- i -- 

* (p+x)i(p-x-m)! 


U **IrE '* (p+x)!(p-x-m)! 

• ~\l+E + E**E V...1 -*- - -- 

L J (P+x)!(p-x-rrt)! 

. t ± /W P*-l 

‘' ^ (p+x)l(p- 3 t- m)! = 


— — A -«• 

(p+x)!(p-x-n))! (p+x+lj! (p-x-m-1) 1 (p-tx-t2)>(p-x-m-2) 

+ .* > 

between the limits -p and p4-l . Substitution of the 
upper limit, p+1 , makes every term zero, because (p-x -7rj-A)!.oo. 


For x~-p , the right member becomes 

y __i__ 

0!(2p-m)! l!Up-m-l)! 21 (2p-m-2)! 

+ (2p^rn^T)l \.Zp-rr}-(2p-m-l)']! + (2p-m)!0!' 
all succeeding terms being zero because of the second factor. 


(p+X) ! (p-X-m)! 
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CUKVH APPROXIMATION 


_ _i__ 

(Zp- m)! 



or 


Zp- m 

< 45 > * 


Returning to (43) and then to (42), we have 




™ , 
fn. 


(-l) m (Zp)! 

2 zp+m ' (Zp-m)l 


ml (2p) 
Z 2m 


{77?) 

- = 5^, by (22), 



m' 


P (m) 

p (m-Z) 


W W Pfin-tJ ■ See (!7)] 

Therefore, C J J y T) ( x )=(-j[) n A n Cx (p+x) (nJ satisfies both 

conditions of (22), 

Recurrence Formula. 

It is necessary to note that 
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.,, up)! r (n> ~\ 

& CJp+xJ « g Zp 'i)t(p-i-l)/ (p+jc)!(p-x)^ 

(48) vC^CjD**/” ‘ (-2±*-n-J). 


.wA, . Jn+Z) A n 

A C (p+x.) = A 




= A” Jc^, (p+xf n} (~2*. * n jj 

- (-Zx+nJA C x (p+x) + o{-2)A C^j(p+z+t) 

I*ut C^ + j (p+z+ly n) * fj +A)C } ,(p-t-x) ^ 77 ^. 

-Z >iA C JL (ptx/ n -2nA ‘ C^(pt-x/^ 

- - ZxA n C' h (psx) m - n A n 1 1 ~AC x (ps*:/”^ -20 »x) 
= -2x. A’'C^tp+x) f ”-r)A ” J C ) . (pHxf” / \-2x.+ n-1) 

- vA n / C (p^2)^ r> '^(2p+2z 2n+Z) 

Xf 

*-Zx,A n C^(p+x/ 71 ~n(Zp+l-r))A C z (p+x) * \ 

\ & {0+ zj <T * ('Z) 77 (*)> 
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1 n ~ ! 

: (-Z) cj> m ,M-- 2x(-z) n cJ„<z)-TtOWnWcj^,ex), 


or 

( 49 ) 4-tp n ^ / (x)-4z(P n fc)+ n {2p+/- ”)&„./ M « O. 


Difference Equation . 

To simplify the reductions later in this development, it is 
desirable to have the following identities:— 


( a ) 





4xf±ax+^zp_ 

(p+x+ZX,p-rn + l) 


C 50 > 

( c ^ C x+X ^X p + x +1 ' 

... . - _ --?•*-? - P-x -Zx-3^ 

(d) A - C x+/ p+ x +2 “ p-fx-t/ ' p+x+z’ 

< e ) - r -f p -* )(p ; x - 1 } - - 

x+Z x (p+x +2 )(p*x+J) 

Let « C x (p+x) t77 \ 

A\.A n C x ep+*J'”'- (-Z) n C x 4> n fc), 

A u x = ^ l'/ 5 ^ <!7 ~ J ^-Zx+ n-1), (by 48) 

and , multiplying both sides by ( p + x - n + 1 ) , we get 

( p + x .- n + L ) Au x = (~ Zx +- n - 1 ) u x 
Difference this equation t ?+1 times , having 

^ , ,, n+2 s , r>+/ 

CP+<*~ 77 + u z + f ; r?+l)A ( u x +/) 

n-f- / 

=-(- 2 x + n-l )A u ^- Zfn ^ l ) A n u.^ t 


( 51 ) 
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or, since a x+j = (L+A)-~ u x , we have 
f52) (p+x-n+l )A u x ■* (rt+i)A (n+DA n *' 

~(-2x-ni~l)A r ’* 1 u^-Z(n+l)A n+ u x -Z(n+l)A n u x . 
Combining like differences, we get 

(p+x+Z) A u 4- (2x4-2 r?+4)A u+ Z(ri+l)/\ u = O. 

X XX 

(S3) (p^+2)A Z (-Z) n Cj r) fa)+{2x+2rt+4)A(-Z)\<p n fa) 


+ 2 {r?+l)(-Z)\<fi n fa)* 0 . 


This may be simplified by making the following reductions; 


4 .za^m-ac^Xm-c^ 

(p-x)(-2x-3) 

' (p+Xf 2 )(p+z+l) 


*^^•4 +ZAtj&c J^ m ZpL 
11 * (p+x+z)( p *x+i) ” *<^nrz7Z- 


+A%fa).r j£^fe^2— , 

17 x (p+X+Z)(p+X-t-lJ 


(J> n fa)‘C x p+x+j +A&P (xt'Cx'p+x+j 


sail 
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Substituting these values in (53), noticing that C % is a 
-common factor, and that — can be made a factor by 

multiplying the last term by ^~rri > we have 

(54) 

(p-x)(p-x-l}A Z 4 n U)*&(p-x)(-2x'3)*(p-x}(2**2n*4)£ 1(4 te> 

+x Z -i-8z:-+2-2p+f2x-t-2r?+-4)(-2x-I)i-Z(riD-txr^jjj 4> n (x)^ O . 

The coefficient of $ fe) reduces to Zn(p-x), and that of 
A$„(*), to (p-x)(2 n -Z-2x.) . Dividing by (p-x), 

we obtain the desired second order difference equation 

(55) (p-x-1)A z 0 n (x)+2(n-l-x)A <t> n (x)+Zv# n (*)-O. 


V. Other Methods of Deriving These Functions,— 
3rd. Method. 

Equations (2) were divided into two groups, (4) and (4a), 
from which these functions were developed. It would be expected 
that the functions could be derived from (2). This is easily seen 
to be true. 

Let ®(r>) be the determinant formed by the coefficients of 
a„ L in (2). 


m Q O rn z O 




(56) Let O 


■M 


O m 2 


m z 0 


O 0 ... . 

TT7^ O m fa . . . . 
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for 77 > <9 In the special case r>=-J, we will define ^ 

I m 0 O w z O . 1 


Let a 


O tt 7 z O 


m z O O 


Then <j) (z) is easily seen to be 


157) 4> n (x) 


Q(n.x) . 
Q ifn-J.) 


Explicitly, we will have 




m a 1 

O z 


m 

Q 

0 

1 

0 

m z 

X 

™2 

O 



O 


O 

m z 



2 • rn ? ,2 

- X - 

171 0 777 2 


z £ 

" Z 9 
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m c 

0 


1 


0 

™z 

0 

X 



0 

m 4 

x z 


0 

m 4 

0 

x^ 

. r 7. /^V 777 ^ 

m o 

0 

m z 



0 


0 



m z 

0 

m 4 




■ 3 o-l 


fn 


x, 


and so on. 

The coefficients of the 0‘s can be found from the formula 

( 58 ) 

*(1) 

where Q ( ^ is , the last column being replaced with M Q 
M n M Zi . M l . Thus, we have 
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Mo 

m. 


- M a , 


m o M o\ 


0 M, 

m 0 O 

0 "H 


L HkH .Im.-A'M,, 


771 o m 2 


m„ 


O At 


O m, M t 


w z 


O M, 


m. 


% ° m z 


O m, O 


W A 


-J]2g!?o + rrl o M 2 

m o m A- m 2 2 O ’V’ 77 / 


2 _ 

~2p~l 


M 0 + 


4 _ 

p(2p-l) 


M ,, 
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% 

0 


"o 


0 

m z 

O 

M, 


m K 

0 


M z 


0 


o 

3 

M'(m Q rr>*)+M 3 - m x ) Z 


o 


0 

m fo m z m*Mrr£(m 0 m*- rrg) 

o 

m z 

0 



m z 

0 

% 

0 


0 


0 

% 



1 fggVj 

l 6 p z (2p-l)(p-l) 


M +C M , 

/ 3 3 


This method becomes more difficult than the first because of 
the higher order determinants, and is, therefore, of less value in 
deriving the functions. 

A fourth method of developing these polynomials is to build 
up a set of orthogonal functions in the following manner : 1 

*This method is given in a thesis by Harry R. Mathias, entitled “Prop¬ 
erties of Orthogonal and Biorthogonal Functions from the Standpoint of 
Integral Equations/’ written at Indiana University August, 1925. He cites 
as his reference E. Goursat—“Recherches sur les equations integrates 
lineaires,” Ana de la Fac. De Toulouse, t. 10, 2nd series, 1908, pp. 5-98, 
especially page 66. 
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Assume a set of functions 

fjxhl, f z U)*x, z f 3 (x)= k 3 f 4 't*)-X' 4 ‘ 

It is required to find a set of functions, ^ (x>) , such that 


L C (p rrl U)<p r) (z)=0, for mt n. 
- P 


I-et 4> 0 (x)=f o (x.) = 1. 


We may then form the equations 

p r 1 

L C x f L M-C£ l £ M I f 0 U) =0 
-p L -1 


obtaining 


LC X 

-p * 


*-a L -1 


1 - 0 , 




(59) 




-P 




the solutions of which are 




P 


S7. a <0 » rrf — 


pOp-1) 



242 


CURVE APPROXIMATION 


Let & -a f 'l » % * <&£ (%), 


^ ^ , 


pf3p-l). 


e 4 M—4 

Form a set of equations, similar to (59), using the 0 S 
ZC x fz*-§-b,*)*.0. 

ZC x (x*- b 2 x)x.= 0 t 

£cJS-*£%2- h x)x.O. 

-P 

From these equations, we get 

b , * O\ b 2 • 4 =< 3 - 

Let ilr,(*)=(x 2 -§)-<J).(x.)> 


l£(z)m.(x*- 

„ , » y >*, Pfep~l)\ 

\JJjx)*(xT- -+-% -> 

Similarly, the equations 

s c,u‘- py**- §>-o, 

\u*-z\u>- §>-o. 
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lead to the values 

c,~o, C 2 =(Jp-Z). 

C f is easily seen to equal 0 , as the term independent of 
C t is 23 (C^ multiplied by an odd function in x . Expanding 
the second equation, we get 



and substituting the values of the moments, we reduce C z 
to the value above, C z = (5p-Z) . 

Let M, 

-Op-zx**- j j 

- S-PP-2U** 

This method may be continued to obtain as many of the tp ‘s 
as desired. It is to be noted that X. (x) is <fi . (x) . However, 
the proof of this would necessitate adding another function, 
Cx.) , at the beginning, and carrying out the process with 
another equation in each set. 



244 


CURVE APPROXIMATION 


Method 5. 

Gram's equation, number 18. is 

This gives /'s for the origin at the left end of the distribu¬ 
tion, the variates running from O to n . Let this equation be 
transformed as follows: 

Let m = n, 

x*x'+p , the primes being dropped, 

If the values of rm 0,1, 2, ■ • • be substituted, 

we will have $„(*>•(-$)”?„ (x) and the squired func- 

tions are found again. 

VL Example. 

As an example to illustrate the use of this development, I 
have chosen one used by Karl Pearson, 2 the data of which he 
attributes to T. N. Thiele, 3 and are the frequencies in a game of 
“patience/* On page 295, Vol. I, Pearson states, “Now either of 
the curves in Illustration I and II is a good example of the im¬ 
possibility of using the method of least squares for systematic 
curve fitting." (The set of data below is his illustration II.) 

In volume 2, using the Method of Moments, he fits curves of 

^ram uses the notation 77 777 instead of 77 ^, and (m) r 

2 On the Systematic Fitting of Curves to Observations and measure¬ 
ments. Biometrika, vol. 1, (1902), pp. 265-303; vol. (1903) pp/1-27. 

3 Thiele~-Forelaesninger over Almindelig Iagttagelseslaere, Copenhagen, 
^ (1889) p. 12. 



H. E. II. GREEN LEAP 


245 


the second to the sixth degree to this data. Noting that his state¬ 
ment. nage 18. “taking the sixth parabola as the*hest fit” is correct, 
it is found that the sum of the squares of the errors is more than 
1400. llis results for the skew frequency curve gives as the sum 
of the squares of the errors, 782, which indicates the second curve 
to be a more accurate fit. 

Let the given data be 










CURVE APPROXIMATION 

P 

The* moments, M ~ L y'X 1 

1 -p x 

. are computed and found to bo 

M =$00, 
o ' 

M - 146b, 

It 

l 

<3 

M = 2b, bb4 

M =208d, 

M 9 = b4, OIO, 

M= 607 J 68 

O 

From Table I, Cm p- 7 

, we find 

A, = 0.28571 42857. 

6? = (2)502(0 85(26. 

S> 2 = -015384 61538, 

E $ - - (2)10656 01066. 

C A = 0.043956 04396, 

F s = (4)35520 03552. 

B m =-0 048840 04884, 

D b - -.(2)10360 01036. 

C = (2)48840 04884, 

<4 = (3)99719 21083, 

C A » .(1)13986 01399, 

/ 6 --.(3)11182 23340. 

* - (2)84360 08436. 

J b * .(5126111 13742. 

2^ = .(3)44400 04440, 


Substituting in the abme equations and putting p= 7 in (x) 

we have 
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& 

fl 

It 

4> a (*) = 1, 

a t = -20 

8 t M = x>, 

a z *14.85714, 

fyx) -- 

a,* 10.07875, 

X 

rr\* 

X 

If 

'—s 

vii 

■Qu' 

af 1.21744, 

$ 4 (x) = x. 4 -/9x z +3L), 

v 0.59)84, 

4> } M •x*-?0x'*+i41?x t 

a b , 0.18056, 

4> b fie) = 42. 5x\579x 2 -593.7). 


Multiplying the corresponding values, and collecting like 
powers of x , wc have 

y«^y'= (411258)6 -69.80444x +60.1)802x z -1.29045x 3 

- 6.4)6)6x*+ O. )9164.* % 0.18056x8) C x . 

[I would call attention here to the fact, that if it is desired 
to have an equation of the second, third, fourth, or fifth degree, 
the above data can be used, without change, using only as many 
functions as the degree required. Thus for the fourth degree 
equation, we have 

(a 0 4> 0 M + a, M + a z 4> l M+a 3 8 } M)c x , 
y*(486.749)6 -!Z).787)0x -8.27422x z +1057875** 


+1.Z1744x 4 )CJ 
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CURVE APPROXIMATION 


Substituting values of x from -7 to +7 in the above equation, 
we obtain the following results- 


X 

y' 

c x 

(From Table 11 ) 

iBS 



Krior 

yy* 


.3385 714 

.(4)61035 156 

20 

0 

0 

0 

-6 

258.775 

.(3)85440 219 

>> 

0 

0 

0 

-5 

- 20817 

.(2)55541 992 

— 12 

0 


- 1 

-4 

421.1'JO 

.(1)22216 707 

9 36 

9 

7 

) 

— 3 | 

713454 

(1)610% 191 

43 50 

44 

35 

9 

-2 

701412 

012219 238 

85 71 

86 

l()i 

-15 

-1 

530.835 

0 18328 857 

08 94 

90 

89 

10 

0 

415254 

020947 266 

8698 

87 

94 

— 7 

1 

398.437 

018328 857 

73 02 

73 

70 

3 

2 

426.808 

0 12219 238 

5210 

52 

46 

i) 

3 

417.224 

.(1)61096 191 

25 49 

26 

30 

- 4 

4 

507.857 

.(1)22216 797 

1128 

11 

15 

-4 

5 

1431.276 

.(2)55541 992 

705 

8 

4 

4 

6 

5016640 

(3)85449 219 

4 28 

MM 

5 

- 1 

7 

14822.246 

.(4)61035 156 

90 

HI 

1 

(1 


The sum of the squares of the errors for this curve is 562. 
as compared with more than 1400 in Pearson's first method, and 
782 for his skew frequency curve. 

The fourth degree curve, found In this method, gives 1170 
for the sum of the squares of the errors. 

*Taken as 26 so that Z y * E y x - ?00. 
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TABLE I 


(Number in parenthesis indicates the number of ciphers between the 
_decimal point and the first significant figure.)_ 


p 


A z 

i 


P 

1 

2.00000 00000 

2.000(H) 00000 

- 2.00000 00000 

4.00000 00000 

1 

2 

1.00000 00000 

1.66666 66667 

- 0.66666 66667 

0.66666 66667 

2 

3 

0.66666 66667 

1.60000 00000 

- 0.40000 00000 

0.26666 66667 

3 

4 

0.50000 00000 

1.57142 85714 

- 0.28571 42857 

0.14285 71429 

4 

5 

0.40000 00000 

(.55555 55550 

- 0.22222 22222 

.(1)88888 88889 

5 

6 

0.33333 33333 

1.54545 45455 

- 0.18181 81818 

.(1)60606 06061 

6 

7 

0.28571 42857 

1.53846 15385 

- 0.15384 61538 

.(1)43956 04396 

7 

8 

0.25000 00000 

1.55333 33333 

- 013333 33333 

.(1)33333 33333 

8 

9 

0.22222 22222 

1.52941 17647 

- 0 11764 70588 

.(1)26143 79085 

9 

10 ] 

0.20000 00000 

1.52631 578)5 

- 0.10526 31579 

.(1)21052 63157 

10 

11 

0.18181 81818 

1.52380 95230 

-(1)95238 09524 

.(1)17316 01732 

11 

12 

0.16666 66667 

1.52173 91304 

-{1)86956 52174 

.(1)14492 75362 

12 

13 

0.15384 61538 

1.52000 00000 

-7(1)80000 00000 

.(1)12307 69231 

13 

14 

0.14285 71429 

1.51851 85185 

-(1)74074 07407 

.(1)10582 01058 

14 

15 

0.13333 33333 

1.51724 13793 

-*(1)68965 44828 

.(2)91954 02298 

15 

16 

0.12500 00000 

1.51612 90322 

-(1)64516 12903 

.(2)80645 16129 

16 

17 

0.11764 70588 

1.51515 15152 

t(1)60606 06061 ■ 

.(2)71301 24778 

17 

18 

0.11111 11111 

1.51428 57143 

t(1)57142 85714 

.(2)63492 06349 

18 

10 

0.10526 31579 

1.51351 35135 

-(1)54054 05405 

.(2)56899 00427 

19 

20 

1 0.10000 00000 

1.51282 05128 

7(1)51282 05128 

.(2)51282 05128 

20 


P 


B 3 

S 

p 

? 

3.77777 77778 

- l.iiiii urn 

0 44444 44444 

2 

3 

2.08888 88889 

- 0.35555 55556 

.(1)88888 88889 

3 

4 

1.46031 74602 

- 0.17460 31746 

.(1)31746 03175 

4 

5 

1.12592 59259 

- 0.10370 37037 

.(1)14814 81481 

5 

6 

0.91717 17172 

-,(1)6§686 86869 

.(2)80808 08081 

6 

7 

0.77411 47741 

-.(1)48840 04884 

.(2)48840 04884 

7 

8 

0.66984 12698 

-.(1)36507 93651 

.(2)31746 03175 

8 

9 

0.59041 39434 

-.(1)28322 44009 

.(2)21786 49238 

9 

10 

0.52787 52437 

-.(1)22612 08577 

.(2)15594 54190 

10 

11 

0.47734 48773 

-.(1)18470 41847 

.12) 11544 01154 

11 

12 

0.43566 09574 

-.(1)15371 10232 

.(3)87834 87044 

12 

13 

0.40068 37607 

-.(1)12991 45299 

.(3)68376 06838 

13 

14 

0.37091 30375 

-.(1)11124 67779 

.(3)54266 72093 

14 

15 

0.34526 54625 

-.(2)96332 78599 

.(3)43787 62999 

15 

16 

0.32293 90681 

-.(2)84229 39068 

.(3)35842 29391 

16 

17 

0.30332 73916 

-.(2)74272 13310 

.(3)29708 85324 

17 

18 

0.28596 32742 

-.(2)65981 94834 

.(3)24898 84843 

18 

19 

0.27048 10073 

-.(2)59006 37480 

.(3)21073 70528 

19 


0.25659 01934 

-.(2)53081 42149 

.(3)17993 70220 

20 
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CURVE APPROXIMATION 


p 

\ 


* » — a. . 

P 

2'""" 

2.66666 66667 

-3.33333 33333 

0.666M> ft6667 

2 

3 

2.20000 00000 

- 1.33333 33333 

0.13333 33333 

3 

4 

2.08571 42857 

- 0.85714 28571 

.(1)57142 85714 

4 

5 

2.03174 60317 

- 0.63492 63492 

.(1)31746 03175 

5 

6 

2.00000 00000 

- 0.50505 05051 

.(1)20202 02020 

6 

7 

1.97902 09790 

- 0.41958 04196 

.(1)13986 01399 

7 

8 

1.96410 25641 

- 0.35897 43590 

.(1)10256 41025 

8 

9 

1.95294 11765 

- 0.31372 54902 

.(2)78431 37255 

9 

10 

1.94427 24458 

- 0.27863 77709 

.(2)61919 50464 

10 

11 

1.93734 33584 

- 0.25062 65664 

.(2)50125 31328 

11 

12 

1.93167 70186 

- 0.22774 32712 

.(2)41407 84»749 

12 

13 

1.92695 65217 

- 0.20869 56522 

.(2)34782 60870 

13 

14 

1.92296 29630 

- 0.19259 25926 

.(2)29629 62963 

14 

15 

1.91954 02299 

- 0.17879 94891 

.(2)25542 78416 

IS 

16 

1.91657 39711 

- 0.16685 20578 

.(2)22246 94105 

16 

17 

1.91397 84946 

- 0.15640 27370 

.(2)19550 34213 

17 

18 

1.91168 83117 

- 0.14718 61472 

.(2)17316 01732 

18 

19 

1.90965 25096 

- 0.13899 61390 

.(2)15444 01544 

19 

20 

1.90783 09078 

- 0.13167 01317 

.(2)13860 01386 

20 


P 



5 

t’ 

2 

7.77777 77778 

- 1.77777 77778 

O.44444 

> 

3 

171851 85185 

- 0.20740 74074 

.(1)29629 62963 

3 

4 

0.77777 77778 

- .(1)63492 06349 

.(2)63492 06349 

4 

5 

0.44656 08466 

- .(1)27513 22751 

.(2)21164 02116 

5 


029046 01571 

-.(1)14365 88103 

.(3)89786 75645 

6 


020424 02042 

-.(2)84360 08436 

.(3)44400 04440 

7 


0.15152 62515 

-.(2)53724 05372 

.(3)24420 02442 

8 


0.11692 08424 

- .(2)36310 82062 

.(3)14524 32824 

9 


.(1)92970 98957 

- .(2)25685 12785 

.(4)91732 59947 

10 

n 

.(1)75704 41255 

- .(2)18834 96620 

.(4)60757 95549 

11 

12 

.(1)62843 75850 

- .(2)14220 88379 

.(4)41826 12878 

12 

13 

.(1)53006 31735 

- .(2)10999 62840 

.(4)29728 72538 

13 

14 

.(1)45312 71198 

- .(3)86826 75349 

.(4)21706 68837 

14 

15 

.(1)39181 82002 

-.(3)69735 85518 

.(4)16217 64074 

15 

16 

.(1)34217 03127 

-.(3)56853 29378 

.(4)12359 41169 

16 

17 

.(1)30140 11078 

-.(3)46959 15512 

.(5)95835 01045 

17 

18 

.(1)26751 17185 

-.(3)39234 54904 

.(5)75451 05584 

18 

19 

.(1)23903 60285 

-.(3)33115 82259 

.(5)60210 58653 

19 

20 

.(1)21487 88465 

-.(3)28206 34400 

.(5)48631 62758 

20 
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p 




P 

“7~ 

5.31555 55556 

- 2.31111 mil 

0.19555 55556 

3 

4 

3.15206 34920 

- 0.86984 12698 

.(1)46349 20635 

4 

s 

2.28176 36680 

- 0.46490 29982 

.(1)18059 96473 

5 

6 

1.79717 17172 

- 0.29090 90909 

.(2)88888 88889 

6 

7 

1.48530 58058 

- 0.19962 25996 

.(2)50260 85026 

7 

8 

1.26686 60968 

- 0.14562 47456 

.(2)31176 23118 

8 

9 

1.10499 84438 

- 0.11098 66169 

.(2)20666 04420 

9 

10 

0.98009 86125 

-.(1)87421 16730 

.(2)14402 01812 

10 

11 

0.88072 97705 

-.(1)70654 75135 

.(2)10436 86658 

11 

12 

0.79975 28076 

-.(1)58297 25830 

.(3)78047 55631 

12 

13 ' 

0.73247 36327 

-.(1)48925 37414 

.(3)59889 86859 

13 

14 

0.67567 56659 

-.(1)41647 89943 

.(3)46958 80252 

14 

IS 

0.62707 92809 

-.(1)35883 40140 

.(3)37500 17543 

15 

16 

0.58502 29532 

-.(1)31239 29587 

.(3)30421 80906 

16 

17 

0.54826 66002 

-.(1)27442 67469 

.(3)25019 32673 

17 

18 

0.51586 56594 

- .(1)24299 01253 

.(3)20824 49142 

18 

19 

0.48708 78486 

-.(1)21666 60247 

.(3)17517 73888 

19 

20 

0.46135 66062 

-.(1)19440 22295 

.(3)14875 87453 

20 


P 


E , 

F , 

P 

3 

1.27407 40741 

- 0.11851 85185 

.(1)11851 85185 

3 

4 

0.31746 03175 

-.(1)19047 61905 

.(2)12698 41270 

4 

5 

0.12768 95944 

-.(2)56437 38977 

.(3)28218 69488 

5 

6 

.(1)64197 53086 

—.(2)22446 68911 

.(4)89786 75645 

6 

7 

.(1)36852 03685 

-.(2)10656 01066 

.(4)35520 03552 

7 

8 

.(1)23117 62312 

-.(3)56980 05698 

.(4)16280 01628 

8 

9 

.(1)15458 03507 

-.(3)33198 46457 

.(5)82996 16142 

9 

10 

.(1)10847 37988 

-.(3)20639 83488 

.(5)45866 29974 

10 

11 

.(2)79052 85098 

-.(3)13501 76788 

.(5)27003 53577 

u 

12 

.(2)59393 10287 

-.(4)92017 48332 

.(5)16730 45152 

12 

13 

.(2)45755 21097 

-,(4)64862 67356 

.(5)10810 44559 

13 

14 

.(2)35996 92489 

-.(4)47031 15814 

.(6)72355 62791 

14 

15 

.(2)28829 97519 

-.(4)34930 30313 

.(6)49900 43304 

15 

16 

.(2)23447 56961 

-.(4)26484 45363 

.(6)35312 60483 

16 

17 

.(2)19326 72711 

-.(4)20444 80223 

.(6)25556 00279 

17 

18 

.(2)16118 23181 

-.(4)16033 34937 

.(6)18862 76396 

18 

19 

.(2)13582 79996 

-.(4)12750 47715 

.(6)14167 19683 

19 

20 

.(2)11552 71331 

-.(4)10266 67694 

.(6)10807 02835 

20 
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p 

A 6 

*6 

< "b 

P 

3 

31.86666 66667 

- 2.95555 55556 

1.24444 44444 

3 

4 

9.86349 20646 

- 1.68888 88889 

0.31111 11111 

4 

s 

6.24338 61423 

- 1.20740 74074 

0.14814 81481 

5 

6 

4.87864 28716 

- 0.94276 09428 

.(1)87542 08754 

6 

7 

4.17145 81715 

- 0.77415 17742 

.(1)58016 05802 

7 

8 

3.75275 83528 

- 0.65703 18570 

.(1)41336 44134 

8 

9 

3.47631 97577 

- 0.57083 96179 

.(1)30970 33685 

9 

10 

3.28104 57516 

- 0.50471 27623 

.(1)24079 80736 

10 

11 

3.13617 37676 

- 0.45235 63811 

.(1)19263 84589 

11 

12 

3.02462 67843 

- 0.40986 52428 

.(1)15764 04780 

12 

13 

2.93620 42788 

- 0.37468 59903 

.(1)13140 09662 

13 

14 

2.86445 51800 

- 0.34507 78315 

.(1)11121 84648 

14 

IS 

2.80510 85568 

- 0.31981 26863 

.(2)95359 72754 

15 

16 

2.75522 87178 

- 0.29799 91485 

.(2)82670 73153 

16 

17 

2.71273 35648 

- 0.27897 43935 

.(2)72358 73754 

17 

18 

2.67610 66890 

- 0.26223 52558 

.(2)63864 45096 

18 

19 

2.64421 82442 

- 0.24739 28474 

.(2)56784 05678 

19 

20 

2.61620 96162 

- 0.23414 18341 

.(2)50820 05082 

20 


P 

*6 

*6 

* 

P 

3 

- 0.88888 88889 

10.85234 56791 

- 3.56543 20985 

3 

4 

-.(1)12698 41270 

28.89171 07594 

- 5.51675 48502 

4 

5 

-,(2)42328 04233 

1.42790 35865 

- 0.19461 49324 

5 

6 

- .(2)19240 01924 

0.86407 78152 

-.(1)91881 78077 

6 

7 

-.(2)10360 01036 

0.58217 60133 

- ,(1)50816 67304 

7 

8 

- .(3)62160 06216 

0.41986 26065 

-.(1)31099 76443 

8 

9 

— .(3)40221 21669 

0.31750 86255 

- .(1)20431 95246 

9 

10 

-.(3)27519 77984 

0.24868 72810 

-.(1)14149 24384 

10 

11 

-.(3)19656 98560 

0.20013 83997 

-.(1)10205 88045 

11 

12 

-.(3)14529 07631 

0.16458 67560 

-.(2)76047 24004 

12 

13 

-.(3)11042 09800 

0.13776 06312 

- .(2)58190 22630 

13 

14 

-.(4)85882 98444 

0.11701 40143 

- .(2)45522 17628 

14 

is , 

-.(4)68114 09110 

0.10063 52146 

-.(2)36284 76722 

15 

16 

-.(4)54930 71863 

*(1)87476 39340 

- .(2)29389 67830 

16 

17 

-.(4)44943 31524 

.(1)76744 10943 

- .(2)24138 37900 

17 

18 

- .(4)37238 74691 

.(1)67875 46964 

- .(2)20068 35826 

18 

19 

-.(4)31200 03120 

.(1)60462 05713 

-.(2)16865 01032 

19 

20 

-.(4)26400 02640 

.(1)54201 64924 

-.(2)14309 22600 

2a 
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p 

i 

V _ 

4 


3 

0 26864 19754 

1 26419 75310 

- (1)98765 43210 

3 

4 

0 24409 17107 

1 19223 98589 

-.(1)56437 38977 

4 

5 

.(2)60764 25632 

.(1)30570 25279 

-.(2)10346 85479 

5 

6 

.(2)22147 39603 

.(1)11372 98915 

-.(3)29928 91882 

6 

7 

.(3)00719 21083 

.(2)51964 49641 

-.(3)11182 23340 

7 

8 

.(3)51454 71812 

.(2)27108 69377 

-.(4)49333 38267 

8 

9 

,(3)29218 90503 

.(2)15524 53840 

-.(4)24473 22709 

9 

10 

.(3)17816 50932 

.(3)95299 97834 

-.(4)13250 26436 

10 

11 

.(3)11479 15011 

.(3)61737 49551 

-.(5)76774 75857 

11 

12 

.(4)77282 94532 

.(3)41752 74961 

- (5)46962 67091 

12 

13 

.(4)53932 11190 

.(3)29248 26113 

- (5)30029 01554 

13 

14 ' 

.(4)38778 42203 

.(3)21098 48164 

- (5)19924 01348 

14 

IS 1 

.(4)28596 27483 

.(3)15602 20206 

-.(5)13639 45170 

15 

16 

.(4)21549 40811 

.(3)11786 12681 

-.(6)95910 77856 

16 

17 

.(4)16546 77743 

.(4)00604 43517 

-.(6)69030 58224 

17 

18 

.(4)12915 92277 

,(4)70028 U4898 

-.(6)50706 35472 

18 

19 

.(4)10229 00232 

.(4)56268 09892 

- (6)37922 29454 

19 

20 

.(5)82061 36861 

.(4)45209 40193 

- (6)28818 74227 

20 


P 


3 

.(2)79012 34568 

4 

.(2)28218 69488 

5 

.(4)37624 92651 

6 

.(5)85511 19662 

7 

.(5)26311 13742 

8 

.(6)98666 76533 

9 

.(6)42562 13407 

10 

.(6)20385 02210 

11 

.(6)10589 62187 

12 

.(7)58703 33864 

13 

.(7)34318 87490 

14 

.(7)20972 64577 

IS 

.(7)13306 78214 

16 

.(8)87191 61687 

17 

.(8)58749 43169 

18 

(8)40565 08378 

19 

.(8)28620 59966 

20 

.(8)20584 81591 
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CURVE APPROXIMATION 


TABLE II 

c . . ap)f 
x " x 2 Zp (p-x.)!(p+*)! 

(The number in parenthesis indicates the number of cipher* between 
the decimal point and the first significant figure.) 


X 

0mJ 




0 

0.31250 00000 

0.27343 75000 

0.24609* 375(H) 

0.22558 59375 

1 

023437 50000 

0.21875 00000 

0.20507 81250 

0.19335 93750 

2 

0.09375 00000 

0.10937 50000 

0.11718 75000 

0.12084 96094 

3 

0.01562 50000 

0.03125 00000 

0.04304 53125 

0.05371 09375 

4 

5 

6 


0.00390 62500 j 

.(2)97656 25000 
.(3)97656 25000 

.(1)16113 28125 
.(2)29296 87500 
.(3)24414 06250 


X 

p=7 

P‘3 

_ 2L*9 . . 

£rJO__ 

0 

0.20947 26563 

0.19638 06152 

0.18547 0S810 

0.17619 70520 

1 

i 0.18328 85742 

0.17456 05469 

0.16092 35229 

0.16017 91382 

2 

0.12219 23828 

0.12219 23828 

0.12139 89258 

0.12013 43536 

3 

.(1)61096 19141 

.(1)66650 39062 

.(1)70816 04003 

.(1)73928 83301 

4 

.(1)22216 79688 

.(1)27770 99609 

.(1)32684 32617 

.(1)36964 41650 

5 

.(2)55541 99219 

.(2)85449 21875 

.(1)11672 97363 

.(1)14785 76660 

6 

.(3)85449 21875 

.(2)18310 54687 

.(2)31127 92968 

.(2)46205 52063 

7 

.(4)61035 15625 

.(3)24414 06250 

.(3)58364 86816 

..(2)10871 88721 

8 

9 

10 


.(4)15258 78906 

.(4)68664 55078 
.(5)38146 97265 

.(3)18119 81201 
.(4)19073 48633 
.(6)95367 43)64 


X 

D‘H 

_ 

p*n 

p- /4 

0 

0.16818 80951 

0.16118 02578 

0.15498 10171 

0.14944 5980& 

1 

0.15417 24205 

0.14878 17764 

0.14301 09445 

0.13948 29154 

2 

0.11859 41696 

0.11689 99672 

0.11512 87556 

0.11332 %m 

3 

.(1)76239 10904 

.(1)77933 31146 

.(1)79151 01945 

.(1)70997 55442 

4 

.(1)40660 85815 

.(1)43837 48770 

.(1)46559 42321 

.(1)48887 39437 

5 

.(1)17789 12544 

.(1)20629 40598 

.(1)23279 71160 

.(1)25730 20756. 

6 

.(2)62785 14862 

.(2)80225 46768 

.(2)98019 83833 

.(1)11578 59340 

l 

.(2)17440 31906 

.(2)25334 35822 

.(2)34306 94342 

.(2)44108. 92725 

8 1 

.(3)36716 46118 

.(3)63335 89554 

.(3)98019 83833 

.(2)14034 65867 

9 

.(4)55074 69177 

.(3)12063 98010 

.(3)22277 23598 

.(3)36612 15305 

10 

.(5)52452 08740 

.(4)16450 88196 

.(4)38743 01910 

.(4)76275 31886 

11 

.(6)23841 85791 

.(5)14305 11475 

.(5)48428 77388 

.(4)12204 05102 

12 

13 

14 


.(7)59604 64477 

.(6)38743 01910 
.(7)14901 16119 

.(5)14081 59733 
.(6)10430 81284 
.(8)37252 90298 
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# 

/> v 

_ P j- /o __ 

r-/7 

0 

0 14446 44481 

013964 99341 

013583 375‘)(» 

1 

013543 5420! 

013171 75850 

012828 74300 

2 

0.11153 50518 

0.10970 46542 

0.10803 15281 

3 

.(D80S52 mm 

.0)80879 21888 

.(1)81023 (>4605 

4 

.(1)50875 (3708 

.0)52571 49227 

.(1)54015 70403 

5 

,(1)27081 60072 

.(1)30040 85273 

.(1)31019 40602 

6 

.(1)13324 57177 

.(1)15020 42636 

.(1)10053 08140 

7 

.(2)54509 61180 

.(2)05300 20158 

(2)76326 62309 

8 

.(2)18959 80497 

.(2)24480 82559 

.(2)30530 04024 

9 

.(3)5520*1 00017 

(3)78367 44180 

.(2)10508 30100 

10 

.(3)13271 90548 

.(3)21098 92(30 

.(3)31313 48040 

li 

.(4)25522 89510 

.(4)40880 50370 

.(4)78283 71590 

12 

.(5)37811 09053 

.(5)83725 89940 

.(4)10190 03089 

13 

.(6)40512 53200 

(5)11548 39992 

.(5)20994 38482 

14 

.(7)27939 67724 

.(0)11548 399<>2 

.(0)34831 46429 

15 

.(9)93132 25746 

.(8)74505 80507 

(7)32054 49777 

16 

| 

.(0)23283 00430 

(8)10790 00471 

17 



(10)58207 00091 



p*/q 

y 0*19 

p.~2Q 

T 

013206 0=5996 

012858 53206 

0 12537 06876 

i 

0.12511 00417 

012215 60546 

011940 00549 

2 

010634 35354 

0.10470 51897 

010311 87474 

3 

.(1)81023 64006 

(1)80908 55565 

.(1)80701 62839 

4 

.(1)55243 39504 

.(1)50284 21262 

.(1)57103 05345 

5 

.(1)33020 41437 

.(1)35177 63289 

.(1)36584 73821 

6 

.(1)18214 30779 

.(1)19699 47442 

.(1)21106 57973 

7 

.(2)87428 67737 

.(2)98497 37209 

.(1)10944 15245 

8 

.(2)30989 05581 

.(2)43776 60982 

.(2)50812 13040 

<) 

.(2)13099 05030 

.(2)17197 95385 

.(2)21025 71101 

10 

.(3)44034 59025 

.(3)59303 28910 

.(3)77094 27591 

11 

.(3)12147 47317 

.(3)17790 98075 

.(3)24809 12126 

12 

.(4)28344 10407 

.(4)45912 22380 

.(4)69944 40355 

13 

.(5)54859 55026 

.(4)10043 29897 

.(4)16956 21904 

14 

.(0)85718 0560(5 

.(5)18260 54358 

.(5)34909 86273 

15 

.(0)10390 00747 

.(6)26853 74056 

.(6)59845 47897 

16 i 

(8)91077 00595 

.(7)30689 98921 

.(7)83118 72080 

17 i 

.(9)52386 89483 

.(8)25574 99101 

.(8)89858 07654 

18 

19 

.(10)14551 91523 

.(9)13824 31946 
(11)36379 78808 

.(9)70940 58673 
.(10)36379 78807 
.(12)90949 47018 




APPROXIMATION AND GRADUATION ACCORD. 
ING TO THE PRINCIPLE OF LEAST SQUARES 
BY ORTHOGONAL POLYNOMIALS* 

By 

Cttarifs Jordan, Dr. Sc. 1 diversity of Budapest 

PREFACE. 

In the present paper the mathematical theory of approximation 
by orthogonal polynomials is given in its entirety and accompanied 
by nearly all of the necessary demonstrations. This has necessitated 
some mathematical preliminaries. 

Statisticians less interested in mathematics will find it sufficient 
to read § 12 on the recapitulation of the operations, the beginning 
of § 9 concerning the computation of the mean binomial moments 
and the mean orthogonal moments. § 11 dealing with the method 
of addition of differences, and the examples of §13. In para¬ 
graphs § 14 and § 15 dealing with correlation and in § 17 treating 
graduation, one may observe the end-formulae and ski]) the rest. 
With the aid of these sections and the tables, one may readily 
employ the methods and attain results with a very small amount 
of labor. 

TABLE OF CONTEXTS. 

§ 1. Introduction. 

§ 2. Some formulae of the Calculus of Finite Differences. 

§ 3. Changing the origin. 

§ 4. Changing the interval h , 

§ 5. The problem of approximation. 

§ 6. The deduction of the polynomial 

§ 7. Determination of the coefficient 

§ 8. Determination of the measure of obtained approximation. 

♦Paper read by Professor Harry C, Carver before the 93rd Annual 
Meeting of the American Statistical Association held in Washington, D, C., 
December 28th, 1931. 
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§ 9. Determination of the binomial moments. 

§ 10. Transformation of the orthogonal series in Newton’s 
expansion. 

§ 11. Method of the addition of differences. 

§ 12. Recapitulation of the operations. . 

§ 13. Examples. 

§ 14. First problem of correlation. 

§ IS. Second problem of correlation. 

§ 16. Some mathematical properties of orthogonal polynomials. 

§ 17. Graduation by orthogonal polynomials. 

Bibliographical and Historical Notes. 

§ 18. Chebisheff. 

§ 19. Gram. 

§ 20. Jordan. 

§21. Essher. 

§ 22. Lorentz. 

§ 1. Introduction . It has been shown that in any case of 
approximating a function Ffe) it is advantageous to develop that 
function in a series of orthogonal functions. It was Fourier who 
first used such an expansion in his treatment of trigonometric 
series. The first expansion in orthogonal polynomials was per¬ 
formed by Legendre . In Legendre's polynomials the variable * 
is a continuous one, it takes on every value between and +1 . 
Orthogonal polynomials with respect to a discontinuous variable, 
where x> assumes only the N values have been 

deduced by Chebisheff 1 who has treated the particular case of two 
orthogonal polynomials with respect to equidistant variables. 

Since then several authors have investigated this subject. Poin - 

iChebisheff. Sur les fractions continues, Journal de MathSmatiques pures 
and appliquees 1858, T. Ill (Oeuvres Tome I. 203). 

Sur Tinterpolation par la methode des moindres carres. Mem. Acad. Imp. 
de St. PStersbourg, 1859 (Oeuvres Tome I. p. 473). 

Sur ^interpolation des valeurs Squidistantes. 1875 (Oeuvres Tome II. p. 
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care 2 and Quiquet 3 considered orthogonal polynomials of a discon¬ 
tinuous variable in the case of non-equidistant values. Gram * 
employed these polynomials for • • •, r?, Jordan* 

considered the general case of equidistant orthogonal polynomials 
for equidistant values of x between a and b , the interval of two 
consecutive values oi x being h ; moreover he treated the partic¬ 
ular case of polynomials relative to x « £> 1,2, • • fa-1) which 
Chebisheff has examined in another way. Essher in his first pub¬ 
lication 6 used such polynomials with respect to 

and in his second 7 for x *- 77 , • v • Lorents R also intro¬ 

duced orthogonal polynomials for x=-r? r ~ t O, I,-*-,r? and for 
X/ = -2r?+l, •• •, *1,1, 3, * * ',277-1, the interval in this latter 
case being obviously equal to two. 

In later publications I showed new methods for using orthog¬ 
onal polynomials for approximation 0 and graduation 10 which per¬ 
mit the results to be reached very rapidly. In the present paper the 
general case of orthogonal polynomials for equidistant values of 
the variable is to be discussed; the formulae given are valid for 
all orthogonal polynomials of equidistant values such as the poly¬ 
nomials of Gram, Essher and Lorentz. These are also discussed 
in this paper. At the end some very useful tables are appended, 

- FotwccriV&Calcul cles probability's, Paris, 189b. p. 251. 

:i A . Quiquet , Sur une methode ^interpolation exposee par Henri Poin¬ 
care. Proc. ot the fifth International Congress of Mathematicians. Cam¬ 
bridge, 1913. p. 385. 

l J. Gram. Ueber partielle Ausgleichung mittelst Orthogonalfunktionen, 
Bull, de l’Association des Actuaires Suisses, 1915. 

r> C/r. Jordan. Sur une serie de Polynomcs dont chaque somme partielle 
represente la meilleure approximation d’un degre donne suivant la methode 
des moindres carres. Proc. of the London Math. Soc. 1921. 
n F. Essher. Ueber die Sterblichkeit in Schweden, Lund 19*20 
7 F. Essher. On some methods of Interpolation. Scandia, 1930. 

8 P. Lorents. Der Trend, Vierteljahreshefte zur Konjunkturforschung, Ber¬ 
lin 1928. Zweite Auflage, 1931. 

*K. Jordan, Berechnung der Trendlinie auf Grund der Theorie der klein- 
sten Quadrate, Mitt, der Ungarischen Landeskommission fur Wirtschafts- 
statistik und Konjuncturenforschung 1930. 

Ch . Jordan, Sur la determination de la tendance seculare des grandeurs 
statistiques par la methode des moindres carres. Journal de la Societe Hon- 
groise de Statistique, 1929. t , 

u 'Ch. Jordan, Statistique Mathematique, p. 291, Paris 1927, Gauthier 
Villars. 
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§2. Some formulae of the Calculus of Finite Differences. 
Since the functions considered in this paper correspond to 
a +/? t a+2h f a+3h-% the increment h being constant, the Calculus 
of Finite Differences will be found very useful. 

By the first difference of Ffe) we mean Ffe+h)-F(x) which 
is denoted by AFfcX so that A Ffe) = Ffa +h) - Ffe). 

The 7? -ib difference of Ffe) will be defined as 


We shall term Ffe) the indefinite sum of ffe) and denote it by 
2 ffx)HAFfe)- ffe)- It follows that 2 ffe)*Ffe)+C, 
where C is an arbitrary constant or a periodic function of 
periodicity h . 

The 77-t/7S\xm of Ffe) will be defined by 

Z n ffx)=z[z n ~Wxj[. 

This contains an arbitrary polynomial of the (r?~l)th degree, con¬ 
sidering only the polynomial and neglecting the arbitrary periodic 
function. 

It would be more precise to add the increment h to the above 
notation for the difference A and the indefinite sum F . Thus 
we could use A and F , respectively; but since in our formulae 

h . 

the increment will generally equal h we shall omit this index 
except in cases where doubt might otherwise arise. 

By the definite sum of ffe) between & and b , the following 
sum is understood ( b being equal to a+rtb and 77 an integer) 

f(*)+ fU+h)+ f(a+ 2 h)+ f(b-h) 


and is denoted by ^ 

z 

It can be shown that if Ffe) is the definite sum of -f-fx), the above 


Z f(x). 
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sum is equal to the difference of the values of F6c) taken at the 

limits, so that we have 
t> 

22 ffa) = ffa) +f(a+Ji)+-" + ffb-h) - Ffb)-F6a). 

x-a 

According to our definition it is evident that the value of the 
function f(x) at the upper limit, i.e. f(b t ), is not included in the 
sum between a and b . This terminology, although rather unusual, 
will be adhered to throughout this paper. 

We shall have occasion to employ the following formulae of 
the calculus of finite differences. 

Formula of differencing by farts, or of a product: 

(1) A - u(x)-A v(x.)+ i /{x+h)'A a fa). 

Formula of the n-th difference of a product: 

( 2 ) * I) o ( s ) A u.(j&+‘sh)'A S . 

Formula of the summation by parts, or of the sum of a product: 

ufa)-I2 vfa)~22 [a ufa) Zv{x,+hj \. 

Formula of the sum of a product, Ufa) being a polynomial of 
the n-th degree: 

22\u fa)- ^faj] ~ ufa)-Z2vfa)-Aufa)'22% r fa+h) 

(3) 

+A ufa)- 22 S \/fa+2h) -* 't'C-l^A ufaJ-22 ^vfa-tr?/?)* 

In the second member of this equation 22 vfa) contains one 
arbitrary constant, 22 v/^yvfeJcontains, besides this, one more, and 

so on. Ultimately the second member will contain r?+1 arbitrary 
constants — but since the first member contains but one constant 
it is evident that after simplification all of the terms of the arbi¬ 
trary polynomial must necessarily vanish except the single constant 

term arising from 22 n *vfa+r?h). 

Generalised binomial coefficients . We shall denote the general¬ 
ized binomial coefficient of the 7hth degree by 

/X ) __ xfa-hj-fa^h)- - • -(jz-nh+h) 

( 71 'h ” l * Z - 3 • 


77 
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where the index h is associated with the decrement, li h=l the 
index will be omitted, and the expression above will be equal to 

the ordinary binomial coefficient, 

Let us mention the following well-known formulae: 




Expansion of a function in a series of generalised, binomial 
coefficients: 

( 4 ) 


The generalized binomial coefficient can be expressed as an 
ordinary one by merely changing the variable. Thus if we place 
(x-&)/h = £ we have that 



and consequently if we write * f f&+ h %) it follows 


that A S FfO)=A S ffct) , and formula (4) may be written 
h 

Fft) = FfO)+(f)- AFf0)+(*)-4*F(Q)+ 
■■■ + {£)A n F(0). 


(4') 


Formulae (4) and (4') are two different forms of Newton's 
series. The great importance of Newton's formula to the statis¬ 
tician is not yet sufficiently recognized by the latter, since he 
nearly always develops his functions in power series in spite of 
the fact that he is generally primarily concerned with the differ¬ 
ences and the sums of his function. Now if a function be expanded 
in a Newton series as in (4) above, its m-ih difference and its 
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sum can be obtained immediately by means of the following 
formulae: 


( 5 ) 





ft 77-777 9 


( 6 ) 





A f&J 



Al&kl 


These operations would be very complicated if f(x) were 
expanded into a power series. Although it is true that a power 
series would be more advantageous for determining either the 
derivatives or the integral of ffa), we may remark that the 
statistician hardly ever needs these quantities. And for nearly 
all other operations, Newton’s formula is at least as convenient 
as an expansion in a power series. 

To illustrate the last remark — if f(x) corresponding to a 
given value of x is needed, then in the case of a power series 
it is necessary to compute the values of XjXfx^ • • * and these 
are obtained most readily by means of successive multiplication. 
In the case of a Newton series it is necessary to calculate 

(x-aXfc-aXx-a-h), (x-a)(x-a-h)(x-a~2h), .and these 

should also be obtained by successive multiplication. The formula 
for changing the origin and that for changing the interval of 
observation are given in the following paragraph and are as simple 
as those arising in the case of power series. If a statistician 
expands a function into a power series and needing the differences 
of the function for x* a. calculates them separately, he doubles 
his work since these differences are precisely the coefficients in 
Newton’s formula. In statistical research Newton’s series shook* 
always be preferred to the power series. 
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§ 3. Changing the origin. In mathematical statistics it fre¬ 
quently occurs that it is necessary to change the origin of a set 
of observations. For instance, if f(<s )is the value of some quantity 
corresponding to the first of jAnuary of the year 1901 
a * January 1, 1901, 

h represents the interval of a year, and it follows that x^A+^h 
represents January 1 } (1901+ %)where is an integer. If we 
know the Newton expansion of f(x) in generalized binomial 


coefficients 



that is 




AfCa) ./*-* ) 

h T7 S h JjT, ’ 


and desire the values of ffe) counted from the first of Ju/y 
of 1901 , then denoting 

c*Jvly 11901 

we need ffx) expanded into a series of generalized binomial co¬ 
efficients , that is 

' ' 'h h ( n/ h h 71 

The coefficients of this expansion must be so determined that 
4f»cy-^will correspond to July 1, {iQOl+^.Thtsz are easily 
obtained by putting x*c in the first equation of this paragraph, 


m-fra+c?) 


AIM 


. fc-a ) A n f(d) 


and also placing in the s-th difference of the same equation, 
so that we have 


(7) ^fCc).A S f(a)t( c -, a ) h ^ * r<t) + ■ ■ 



A^fCa) 
H * 7 ~ s 
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Substituting these values into equation (oc) yields the required 
expansion. 

Remark . If c - a * mh where m is an Integer, from the above 
equations it follows that 

f(c) = f(a+mh)z.n& A 5 f(c)’*A s f(a+mh\ 

An example of this is given in § 13. 

§ 4. Changing the interval h . Sometimes a changing of the 
interval is needed in Newton’s formula; this occurs for instance 
in statistics when a function giving the value of some 

quantity corresponding to the middle of the year , is known 
for several consecutive years fh « 1) by its Newton expansion 
(4), and it is necessary to obtain the values of the quantity corre¬ 
sponding to the first of each month. It would of course be possible 
to calculate these values by placing x - 1/12, 2/12, 3/12, * • • in 
Newton’s formula, but it is more advantageous to change the 
interval (h-l) in formula (4) and deduce another one in which 
both the increment of the differences and the decrement of the 
generalized binomial coefficients are k*l/l2. The formula thus 
obtained leads, by the method of summation of the differences 
(§ 11), more rapidly to the results. So, starting from formula 

< 8 > 

we have to deduce 

(9) 

To obtain this, it is sufficient to know that for ^ the differ¬ 
ences of the generalized binomial coefficients in a system 

of differences in which the increment is k may be written 
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( 10 ) 






To obtain these numbers, A w ,let us write the following identity, 
s V/\ jt-a-rnh + h 


A (*'*) ( x ' a ) 

k I m Jh A ( m -l' h 


777 


/ h * £ ‘"^'b 

and then deduce the s-th difference. By formula (2) we have 

A s /x-a i x-a-mh+h A s /x-a ) ^ sk A s ~1/x-a) 

f < m 'b~ » 

Placing in this equation x* a , we obtain 


( 11 ) 


a* . sk~(m-L)h * + sfLA s -l 

777 m M 777-1 777 M rr?-1 


The complete solution of this Equation of Partial Differences 
with the interval k would be very complicated, but one may 
readily solve it for some particular values of s and then deduce 
successively the other values of A ^ starting from the initial 
values which follow immediately from (IQ). These are that 

- O t except that j 

and 


Equation (11) can be solved first for * 9*777 yielding 
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and secondly for s -1 from which we obtain 

A* 


v - a' 

^rn ~ 


?77 


m-1 


./ k-fm-Z)h . 
A m r -™— A „ 

rn-f 777 777-Z 

so that by multiplying we easily obtain 




h ' 

and thirdly for s*m-/we may express successively the values of 


.777-/ „ 777-2 

A m 


■ A z 


. *77- v/-/ 


f//' V # jl-i . , f 

and then multiply each /I by k (m- 0)/m and adding the 

W m v 

products obtain the result 

7n-2 /A- ) 

(m-t)K (zj h • 


. 777-/ 


The other values of are obtained as indicated above. The 


rn 


following table contains the numbers necessary for binomial co¬ 
efficients up to the fifth degree. 

A'-f«) 

M z~^Z/ h 


A, = k 






A Z 3 = Zk(z) h 

£-i(7*->ih)-(i) h K-^(V h A}f 

A**$(3K-5h)-(*) n 

4' **’(*)„ 


Ar= (-5k-7h)(g). 


4-** 
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The n um bers A ^ being known, we can immediately express 

ra in a Newton series, with the increment equal to k, 
/ x ~ a ) Ajzl V x -J L ') daz 

( ( 1 4 ~ I * 4 k* ( 3 4 h 3 


+ • - ■ 


It follows from (8) that 

»*/ w . m+l % ^ 


and consequently the difference of /YW for will be 

Af(«)-Z -Pn-Kn* W- 

* /77=i* 77 777 /7 


When these values are placed in equation (9) the problem is 
solved. An example is given in § 13. 

§ 5. The problem of approximation. The number of the given 
values will always be denoted by N in this paper. The values 
X» y t \ yz>" v correspond to x** t d*h 4 a+Zb, b-h where 
b*&+ T!h. A parabola of the rt-th degree, y- is to be 
‘determined according to the principle of least squares, that is, so 
that the sum of the squares of the deviations \frrfe )-for 
% shall be a minimum. Hence the parameters 

in f n (x) must be so determined that the expression 


( 12 ) 


b 

s 2 



Z 


shall be a minimum. 
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To solve the problem in the ordinary way would require the 
solution of m+i determinants of the rt-ih order. This would 
be very laborious, as those who have employed Gausss method to 
solve this problem know. It is far more convenient to first expand 
the function ffe) into a series of orthogonal polynomials. 

Let be such a polynomial — it is termed orthogonal 

if it satisfies the following relation 

for all values of different from jll . The expansion of 
can be written as follows 


(14) f„M= & B U Q + a, U, + a z u z + 


* a n °- t 7 


where the a^ are constant parameters which must be evaluated 
according to the principle of least squares. 

To render expression (12) a minimum it is necessary that the 
first derivative of *5 with respect to a.^ should vanish for all 
values of \J . This will produce w/ equations which determine 
the W parameters, namely 

Z u\a. e U a + 4 l U l +a z U i +---- + a„U n -yl*0 

Z=d *- 

Asa consequence of relation (13) these equations are so simplified 
-hat we may write 

to „ to 

<s w 2Z Uj*- Z 17^- y - O. 
v z=a v •*=<* 


15 > 
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The second condition of a minimum, namely that the expression 


77 */ 71 + / 

z: z; 

U=0 m*0 


£S_ 


' d *M 


de^ da M 


shall be a positive definite form for all values of da^ and da >u> 
is also satisfied since in consequence of (13) this quantity is 
equal to 

U^fda^f > O. 


From (IS) it may be concluded that the coefficients are 
independent of the degree 77 of the parabola of approximation. 
Consequently, if the coefficients a c , a corresponding 
to a parabola of degree 77 have been calculated, then to obtain a 
further parabola of degree 77 +I it is sufficient to determine only 
one further coefficient, — the others will remain unchanged. 
This is of great importance. If the series (14) is limited at any 
term, the remaining expression will always satisfy condition (12). 

§6. Deduction of the polynomial Uj, Instead of starting from 
relation (13) we shall employ the following equivalent formula, 





where F^^j fe) is an arbitrary polynomial of degree 777 -/. If 

we were to expand F^^ fa) into a series of polynomials 

we would return back to condition (13) again. 

Applying formula (3), of the sum of a product, to the above 
expression yields 

uJfF^SU^M-AF^.Z UJ*+h) 
+A z F m _,-Z , V m U,ih) . 



CHARLES JORDAN 


271 


Now, TJJ m (x.) contains an arbitrary constant to which may be 

assigned such a value that 2ZU m (a)-0. But H U (x+h) contains 

z 

an additional constant which can be chosen so that£ U m (a.+h)=0. 

Continuing after this fashion we may dispose of all these arbitrary 
constants in such a way that the expression for the definite sum 
will vanish for the lower limit % = a, that is 



But in order that the definite sum may be equal to zero it is 
necessary for the above expression to vanish also for the upper 
limit, x-b . But since F(b )is arbitrary for all values of s it 
follows each expression obtained for 0 ,1,2, ust vanish 
separately for x~b- From this we conclude that (x-a) and 
(x-b) must both be multiplying factors of U(x ). Considering 
for the moment only the first of these factors we may therefore 
write 

U(x)=(x-a)co(x). 

Applying to this expression the formula for the sum of a 
product, (3), we have 

By successive summation we should find that (x-&)(x-<z-h)-‘' 
(%-d.-77?/7+h) is a multiplying factor of 22 ^^(x) and that we 
can assign the following form to this expression 


( 16 ) 


Z, m u m (x)=(*-*) h fM. 
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As (X.-&) must also be a multiplying factor of EU^fx^tht 
same reasoning leads to the expression of Z m U rn (.x) 


Z m U m fy)-C(*£) h (% b ), 


As this sum must be of degree 2m> it follows that C is an 
arbitrary constant and we conclude that the general formula of 
the orthogonal polynomials with respect to } b-h > is 

the following 


a ?) v„m. ca m rA]. 

Starting from this expression, there are two different ways of 

deducing the expansion of U m (x) in Newton-serizs, as has been 

shown in the paper 5 . First, we can utilize formula (2), giving 
the w-tf? difference of a product, and obtain 

T77+J 

( 18 ) Ujd.ch ”z o C?X*-% fsh ) h (£$)„ 

Secondly, we can develop Z n \7 Jf) into a Newton-stries of general¬ 
ized binomial coefficients • According to formula (5), we 
have then that 

os, >1 • 

x*b 



CHARLES JORDAN 


273 


The s-th difference in this formula can be written according to 
(2) in the following manner 



x-a + t Jh\ 

7T7-S+ \S'Jl 


/x-b \ T /s y 


X-a+n?h\ , K 
Zm-s /? n 


so it follows that 


! z\ m U / s )r b -< 3 + T77r7 ) 

-u; ^ £' (mX 'h ( s //y> 


ci i finally putting s=m+ \S into this expression, and determining 
the m-ih difference, we see that 

-'i, % t ) f ,( r ”£ / X b -%” h ) n - 

I :t us note that U 0 = C. 

As 2T * r i% r? fc )is symmetric with respect to <2 and b , we can 
get two other formulae for U m (x) from (18) and (21) changing 
a into b and inversely. For instance, remarking that b-a = Nh> 
and that 

/a-b + mh ) s-N+m ), m-\S / 4 \iri-\f/N-V-l \ /_ m-\S 
( m-V = r*V ( 


it lollows from (21) that 
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The constant C is arbitrary, whatever value may be chosen 
for it. The orthogonal polynomials introduced by different inves¬ 
tigators differ only in the value attributed to C. As these poly¬ 
nomials are closely related to Legendre's polynomials it seems 
most advisable to choose C in such a manner that for h*0 the 
limit of the polynomial U m shall be equal to Legendre's poly¬ 
nomial ^. For this purpose, we must put into (19) and (21) 
and, and, then, deducing the 

limit for h=Q , we obtain two known formulae for Legendre's 
polynomials. 11 

The choice of C is only important if we want to compute 
numerical values of U m fa) corresponding to any value of ^ ; in 
this case C should be chosen so that the calculation of 17^ fa) 
shall be as short as possible. As we shall see later Essher in his 
first paper and also Gram proceeded in this manner. The above¬ 
given value that I chose for C is in this respect also very accept¬ 
able; and whenever numerical values of are needed, we will 
adopt this value. It will be shown that our problem can be solved 
in a general way by leaving the constant O arbitrary. 

§7. Determination of the coefficients < 2 ^.These are given by 
formula (15) ; but first it is necessary to know the value of 


b 2 

El/J 

X*cl 


777 


To determine this quantity we shall apply formula (3) of the 
sum of a product to the following indefinite sum 

]= UnMZUJJ-AUj^U^h) 

( 22 ) ■ ■ 


proceeded in this way in the paper 5 loc. cit.; formulae (19) and 
(21) of the present paper are identical with (9) and (13) of the first paper, 
onh the notation is somewhat different. 
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jU \ 

Let Ui> now determine the quantities. 27 \ they 

are casilj obtained, if ju <m+l* by deducing the {m-ju)-ih differ¬ 
ence of Starting from formula (20), after having 

replaced x by k +yuh-h* it follows that 




n x+Mh-h-b \sS \fb-a-t-mh' 

C/? s=m s-m+ju JimA Zm-s / 


At the upper limit. . the first generalized binomial coeffi¬ 
cient figuring in this formula can he expressed by an ordinary one. 


fuh-h \ = / fl-1 \ h s-™+u 

''s-m+jU' h 's-m+j/U' 


but since -s = m, it follows that this expression is equal to zero; 
indeed an ordinary binomial coefficient ($■) is equal to zero, if r 
and t are integers, and if r < t. 

Zl^U^fc+yUh-k) being symmetrical with respect to a and 

b , itsf777y£z)i#difference fayufy-bJ'HiW be symmetrical too; 

but this quantity, as we have seen, is equal to zero for x=b , 
therefore it must also be equal to zero for . 

We conclude that at both limits x=a and x=b all the terms 
of (22) will vanish in which /nm+i, — that is all terms except 
the last. 

To evaluate this last term, we shall determine first the indefinite 
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sum ofZJX- From (20) we easily get, after putting x.+ mh 
instead of x, 


Z 


m+L 




<5 =777 


b-&+ 7 nh\ 

Zm-s ' h “ 


Since mKS+J , this expression will vanish at the upper limit 
z=b and its value at the lower limit, , will be 


m+1 - % 777-4-1 

Z UJ&+mh).%Z r 

m n 5=777 


mX 


<3~t>+r77h\ / b-a+ 777 / 7 } 

3+1 Sh ( ^7Z7-S Sb* 


Noting that b-a^Nh, we can express the two generalized bi¬ 
nomial coefficients of this formula by ordinary ones; indeed they 
are equal to 


/- N+rn \s rt+777 \ 1 
( s+1 )(2m-s) h 


* 177+1 * (-if +1 ( 


N-m+S 1 / /)/+ 7T7 , Xml 
•S+1 AZn?-*) 1 ’ 




so that we have 



According to a known formula of Combinatory Analysis 

(Netto, p. 250,) the last sum of this equation is equal to (- 1 ) /77¥ ’l m 
So it follows that 
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Moreover from (21) we deduce that 

a m u„M. eh *%). 

Now we have determined the values, at the limits, of all the 
quantities figuring in equation (22): hence the definite sum will lie 

(24) i U* . C *”(*S)(&7l) 

It is useful tn remark that this expression is independent of 

z z 

the origin of the variable x .. Let us note also that ZU^-C /V. 

To determine the coefficient from equation (IS) it is still 
necessary to determine Z)f'U m (x). For this purpose, let us start 
from equation (21 '). that is from the AYa'/oa-series of U m , with 
respect to the generalized binomial coefficient We have 



b 

In the analysis of continuous variables the quantity fx s ydx 

d 

is known as the moment of y of the &-th degree (we will say 
the $-ih power-moment). We have seen that in analysing equi¬ 
distant discontinuous variables it is not advantageous to operate 
with powers, but that it is far better to express the quantities in 
binomial coefficients. Indeed if an expression were given in power* 
series, we could not consider it to be a full solution, as it would 
still be advantageous to transform it into a binomial series. There¬ 
fore we have no use for nower-moments and shall introduce 
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binomial moments. The generalized binomial moment of degree 
\J will be denoted by 3 ^ and defined by the following equation 


(26) 


b 

z 

X-d 


(V) h v 


1 /* 


This can be easily expressed by ordinary binomial coefficients, 
for if we introduce a new variable %=(x-<x)/h , we have 


(27) 


B v .h' , Z (')y.. 


As will be shown later in (§9) there is a far better method for 
rapidly computing the binomial moments than there is in the case 
of power-moments. 

Several statisticians have remarked that it is not advisable to 
introduce moments of high order into calculations. In fact, if /V 
is large, these numbers will increase rapidly with the order of the 
moment, will become very large, and their coefficients in the 
formulae will necessarily become very small. It is very difficult 
to operate with such numbers, the causes of errors being many. 

To obviate this inconvenience, I have introduced the mean 
binomial moment; that of the \)-ih degree will be denoted by 7\y 
and defined by 



The sum figuring in the denominator is according to § 2 equal to 



MS*)* 


•J+i'h ~ f V+l 





CHARLES JORDAN 


279 


so that T]/ will be 



If we introduce the variable if mentioned above, we obtain 


^ -fJpy/fZi)- 


Consequently the mean binomial moments are independent of 
the origin of the variable x and of the interval h . The binomial 
moment increases rapidly with /V and also with \), though less 
rapidly than the ordinary power-moments. However, the mean 
binomial moment remains of the order of magnitude of y , what¬ 
ever /V or \) may be. For instance if y=A, it follows that 
for any value of \J . 

Substituting in (25) the value of obtained in (28), we have 

x*a U=O 

To this expression we can give the following form, 

771+1 

(srch^MfZ,) z (-» "Z'X'S) Zl . 

U*C v+l 


To simplify we shall write 


(29) 


x? / j > 777+\S/m+b / )/’ 77 ? )-J— 
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approximation by orthogonal polynomials 

These numbers are integers, and as they are very useful are 
presented in the following table, which presents all the numbers 
necessary for parabolas up to the tenth degree. 


‘J. ’ ct i?l G foi* ft ttj 



1 -10 30 -35 14 

-1 15 -70 140 -126 42 

1 -21 140 -420 630 "462 

-1 28 -252 1050 -2310 2772 

1 -36 420 -2310 6930 -12012 

-1 45 -660 4620 -18018 42042 

1 -55 990 -8580 42042 -126126 


132 

-1716 426 

12012 -6435 1430 

-60060 51480 -24310 4862 

240240 -291720 218780 - 92378 1679 


The following relation can be used for checking these number* : 


@mo * @ml * ~ ° * 


Moreover, let us put 

m+1 

(30) 


If we already know the mean binomial moments, the values of 
O^may readily be computed with the aid of the table above. 
Finally we obtain 

( 31 ) U„My. O m . 


As this expression could be termed the orthogonal moment of 
degree 777 of y , we could consider as a certain mean ortho- 

12 @ 7 r? °* this paper correspond respectively to and 

(-jf) ?7 ?9of the paper loc. cit. ®. 777 
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gonal moment of y of degree m. These quantities, as we shall 
see, are very important; they are independent of the origin, of 
the' interval, and of the constant C . Thus, they are valid for all 
orthogonal polynomials. 

In particular 9 o ~T 0 =BjH\s the arithmetic mean of the 
given quantities y . 

Finally, equations (IS), (24) and (31) yield, after simplifica¬ 
tion, the formula fora^ . that is, 


(32) 


777 C 


The coefficient a m is independent of the origin of x . In par¬ 
ticular if rn=-0, we have /C. 

Now the equation of the approximating parabola is known, in 
the form of its expansion into a series of V ^ polynomials (13) f 
and our problem is solved. But if it is necessary to compute a 
table of the values of the parabola f-n(z) corresponding to 

a+h, a+2h } ., the corresponding values of (z) must 

first be calculated by formula (21'). Although this seems easy 
enough, yet if /V is large, the computation is a tedious one even 
with the aid of Table IV which presents the values of the binomial 
coefficients, (jfj for integral values of . If it should be neces¬ 
sary to compute {^) for non-integer values of , the calculation 
must be made in the ordinary way, that is, by multiplication. At 
all events the calculation would not be shorter if the t^^were 
expanded into power-series. The labor will be decreased, how¬ 
ever, if tables giving the values of ^^corresponding to z**, 
ei+h, a+JZhj - • • •, are available. I adopted this procedure in my 
paper published in 1921 and later Essher, and also Lorents, did 
the same 135 . 


li{ See toe. cit. fl , y , 10 , 7 and s . 
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It will be shown, however, that these tables are superfluous, as 
by a transformation of formula (14) into a Newton-series, we can 
get the required values still more rapidly by the method of addition 
of differences; and if an interpolation is necessary for any values 
whatsoever of x , Newton's formula will give it in the shortest 
way. 

Moreover, by this method we shall be independent of the value 
of the constant C , that is of the orthogonal polynomial chosen. 

We will give in § 10 formulae leading directly to Newton's ex¬ 
pansion, so that the computer will have nothing to do with the 
orthogonal polynomials themselves. He will only have to compute 
the binomial moments 3 , and then deduce the mean orthogonal 
moments O m , which will give, with appropriate coefficients, the 
solution in Newton s series. 

§ 8. Determination of the measure of obtained approximation . 
The approximation is generally measured by the mean-square - 
deviation O *, that is by the mean of the squares of deviations 
between the parabola and the given values y . It is expressed by 


* i p 

~ yy & 
/Y Z=d 



If in this formula we put in place of f n (x) its expansion (14) 
in orthogonal polynomials, we shall obtain in consequence of the 
condition, (13) of orthogonality, 


' f 7i+l p t> 2 

* +k Z n\ a rnZ<- 

i /V 777*0 L. z*a 


, ~ 2 * 

£ Ul £ U 

7 z*a 777 m 


? £ U m y 


']■ 


and on account of (13) 


( 32 ') 


„ 4 & „ 4 77+1 r & 

y *mZ U m - 
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We can still simplify this result, since from formulae (24) and 
(32) it follows that 


(33) 




(2 ™+l){ Jv) 

("W) 


And if to abbreviate we write 


(34) - fcm+lX ”m)/( "*m) 


and note that C a -1 , then the mean-square-deviation will be 


(35) a Z m k? y- e ?- c * 0 **- c *ef-" 


X=ct 


--^<v 


The coefficients C^ can be easily calculated by (34), using 
Table IV for the binomial coefficients. But we shall see that C m 
is equal to the absolute value of a certain quantity, which we have 
denoted by C rr?<? and which is given in Table III for values of tf 
up to 100. 

As the mean orthogonal moments are already known, therefore 
to obtain it is sufficient to compute ZJ y 2 . 

Remark . All quantities figuring in (35) are independent of 
the origin, of the interval and of the constant C ; consequently 
this formula is valid for all systems of orthogonal polynomials. 

Sometimes it is necessary to know CJ^ S , the mean-square of 
deviations between two parabolas, one of degree n , the other of 
degree s , (where r? >$), both approximating to the same values 
of y ; that is 

a ns f s *• 
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If in this formula we place the values of (%) and f s (x.) ex¬ 
pressed in series (13) of orthogonal polynomials, we have 

f n (x,)-f s M= a s+i U s +i + <a s + 2 U s +z + / s-rPn 


and fn consequence of (14) 

- j n*I b 2 

a =-±- r ** t u 

sn h X, Z, u m 

Y 777*S+1 


then, using formulae (33) and (34) we find that 


~ Cs+1 


--+c n e; 


Having obtained the equation of an approximating parabola of 
degree 71 , it may develop that the resultant approximation is not 
close enough, the mean-square-deviation being too large. We may 
then pass on to a parabola of degree ~n+i by determining only 
the one additional coefficient a > —the others do not change. 
For this purpose we must compute and then \ the 

coefficient will be given by (32). The new mean-square-deviation 
will be 


Remark . If the binomial moments .were 

given, and we proceeded to determine a polynomial of degree n 
such that its first n+1 binomial moments should equal respec¬ 
tively to the values given above, thus employing the principle of 
moments, we should reach the same result as though we had 
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imposed the principle of least squares . In the case of polynomials 
both principles lead to the same function. 

§ 9. Determination of the binomial moments . Chetverikoff has 
given a very good method for their determination, which dispenses 
with all multiplication. We have seen that 



Chetverikoff’s method produces the last sum, that is the ordi¬ 
nary binomial moment (7?=1) ; to obtain the generalized binomial 
moment .Sethis must still be multiplied by h ^; but it is needless 
to carry out this multiplication, since we need only the mean- 
binomial-moment Tsj t which is given in both cases by 


The method consists in the following: Let us denote by y(%) 
the value of y corresponding to 5f ; in the first column of a 
table the values of y (^are written in the reverse order of mag¬ 
nitude of , that is 

yri-l), yCN-2), . yCl), y(0). 


Into the first line of all the columns we write the same number, 
y(N~l). Into the \)-th line of column yt we put the sum of the 
two numbers figuring in line \J-1 of column ju , and in line \) of 
column /u - 1 « 

In column/z we stop at the line the number figuring 

there will be ; to obtain the mean binomial-moment of 
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degree ju-2 this must be divided by (//-// If we want the mean- 
binomial-moments Tj, — T n > we must compute 77 +I columns. 
An example is given in § 13. 

An elementary demonstration of this method is given in the 
papers loc. cit. 9 and 10 . We will give here a more direct one. 
Let us denote bythe number written into the \)-th line 
of column/z. The rule of computation will be 


4>(\4/jL -IJ+ It• 


This is an equation of partial differences of the first order which 
may be written as follows: 




Let us solve it utilizing Laplace's method of generating functions. 

We will call u=uf? Jy u)tht generating function of ith 

respect to v , if in the expansion of a in powers of t the coeffi¬ 
cient of t ' 1 'is equal to pfv,//), where jll is a parameter of u • 
Since O , then we have, 

u = + 

&V,ju)t VAH M.)t • • • • 

From this we easily deduce the generating function of 
If we divide both members by t , the coefficient of i ^ in the 
second member will be <p(\J+l >A ). Hence ufc M )/t is the gen¬ 
erating function sought. Since// is a parameter, it follows from 
the preceding that the generating function of ffaju+i) will be 
ufr/i+l), and that of be uft u+l)/t . If in 
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equation (ot ) above we substitute the corresponding generating 
functions for the function^ , we obtain 

(/3) (l-t)u aft, ju)~ O. 

In this way we have reduced the partial difference-equation () 
of the function <2? to an ordinary difference-equation of its gen¬ 
erating function a . The solution expanded into powers of t , 
will give the function <p itself, which is sought. 

Equation (j 3 ) is a homogeneous linear equation with constant 
coefficients, t being only a parameter from this point of view, and 
can be solved immediately, yielding 

( f) 4 / = Cdftj/f l- tJ A 

where u>$) is an arbitrary function of t ; and may be determined 
by the initial values, that is, by the values put into the first column. 
Placing = i into equation ( f) we obtain the generating func¬ 
tion of these values. Hence, 

/y• * * / yf'Oj t 

Finally we have 

a « \yO • • * + y(7V- • • - 

S-O ^ 


Since 
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we may obtain the coefficient of i V in the expansion of u . 
By letting s^^zwe have therefore that 

N+i p, & 

<PMaO-z ( m -2 


If we place in this expression \/szN-ja+2 and = , we see 

that 


(u.zJyfe)- 

We conclude, therefore, that the number figuring in line N-jU+2 
of column ju will be the ordinary binomial moment of degree 
/i-Z . It was this that was to be demonstrated. 

§ 10. Transformation of the orthogonal series into Newton's 
expansion. Since the approximating parabola and the mean square 
deviation are independent of the constant of the orthogonal poly¬ 
nomial used, it is natural to transform equation (13) so that it 
shall also be independent of this constant. This can be done by a 
transformation into Newton's series. 

We have seen that the coefficients of Newton's series (4) are 




To obtain this series, therefore, it is sufficient to determine these 
quantities, starting from the orthogonal expansion. 

Deriving from formula (13) the s-th difference of f n (x ), we 
have 


77 >*$ 


7T? 


**v„r*). 


m 
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To obtain we start from formula (21'). Since (\J. < s)j 1 

is a multiplying factor of the s-ih difference of U m , we conclude 
that for z*-& , all terms obtained from (21') will vanish except 
the term in which \/= s . Hence we have for s= 0,1,2, • • • ,rrj 

A s u m ra)‘Ch 2m f m ^ s ). 

Placing this value, and that of taken from (32), into equation 

(36) we get 

A&JZ s ) /> J ) rn-Ss* j)rrr?+s \ C^JL1&J22 

A T^CciJ^L (- 1 J {£777+!A m J > n? ) ’ 

7T?z:$ ( ^ y 

where is the mean orthogonal moment of § 7. To abbreviate 
we shall write 

( 37 ) C 77JS * £-1) 777 S (2777+1)f ^tt? 5 ) s N ~+rr? S~ * 

1 277 1 


Finally we find that 


(38) A%ra),c ss e s .c s „, s o SfJ 4 .... * C„ s n 


For instance, we have for s=0 , 

frjfa)- C ao O a -*C io @1 +Cg C -f- •■• + C f7Q 

where C ao =l. Let us remark again, that the second member of 
equation (38) is independent of the origin, of the interval, and of 
the constant C of the orthogonal polynomial chosen. 
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The importance of the numbers C 7r)s was first recognized by 
W. Kviatovszky, who calculated a table for these numbers. This 
table has not been published. The author’s Table III is more 
extensive, giving these numbers with more decimals for /Y up to 
100, and for parabolas up to the seventh degree. 

Having obtained, by the above method, the Newton expansion 
of the approximating parabola, it may happen that the expansion 
corresponding to a parabola of degree T)+l is desired, and this 
requires the calculation of 0^+1 . The coefficients of the new 
binomial expansion of f .^ Cx) are easily deduced from those of 
fyjfc) previously obtained, since 

fn+J fa )* ^ fry f&)+ Cr%+I> $ * 


The work previously done is therefore not lost. 

§ 11. Method of the addition of differences. Knowing the co¬ 
efficients of Newton's formula f^d), , we can 

proceed to calculate a table of the values of frifc) corresponding 
to a.+h } a+Zhy, £>-f?by adding the differences. This method 

has been used by H. Henning in his remarkable paper on the 
Trend-lines. 14 It proceeds as follows. The functionary) being 
of degree 7?, it is evident that A is a constant. 

Into the first line of the first column of a table we shall write the 
number A* 7 '*f Into the other lines of the first column we 
put the number of the preceding line of the same column, increased 
by A ^fnCa). We stop in this column at line /V- n+1 . Accord¬ 
ing to Newton's formula, we have 

A 77 '* frjfz)* A^frffy) 


li H. Henning, Die Analyse von Wirschaftskurven, Vierteljahreshefte zur 
Konjunkturforschung. Berlin 1927. 
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or 

A ^ f„ fx). A r*)+ fA C* ) 

where x-&)/h\ hence the first column will contain the values 
of the^-^-M differences of (z). It is advisable, before con¬ 
tinuing, to check the last number in this column by the formula 
above, putting therein /V- 72. 

Into the first line of the /. L-ih column we write the number 
> and into the \)-th line of the same column the sum 
of the numbers figuring in line \)-l of column JJ.-1 and column 
ju . The computation will be stopped in this column at the line 
N-v+ju. The ju-ih column will contain the values of thefa-^Jh 
differences of , which follows from Nexvtoris formula 

4 f„ Cx+h)- 4 M 


or 

(39) 

Before going on, the last number of the column/^ should be 
checked by the preceding formula, putting into it X * N-'h+ju-l. 

We continue in the same manner, — the last column to be 
computed is the r?-thaxi& will Contain the values of 
..., f n (b-h). This last number can be checked by formula (39), 
by making the substitution pL^r? and if*/V-/. An example is given 
in § 13. 

§ 12. Recapitulation of the operations. To solve the problem 
of approximation of § 5, it is necessary first to compute the mean- 
binomial-moments 71, 7], 7^. This is done by drawing up 
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Chetverikofs table (§9) and dividing the number in the line 
ri-jU&oi column// . We obtain in this way 7^: this 

must be repeated in every column. 

Then the numbers are taken from Table I., and the mean- 
orthogonal-moments G q , G s> are calculated by 


(30) 6 C *T 0) Om T c + 7/ + • • • * ft mr77 7 ' 77? . 

Then2Jy0Y is computed. The numbers C ?/? -\C 77? ^\ are taken 
from Table III, or if this table fails, calculated by formula (34) 
and by Table IV which gives the -binomial coefficients. The mean- 
square-deviation is calculated by formula 

(35) -ef-c,e,*+ ••••<, <3/ . 

If this quantity is conveniently small, the approximation is con¬ 
sidered close enough; if not, we proceed to calculate , and 

Cl-n+l* and so on until a sufficiently small mean-square-deviation 
is reached. 

Now we proceed to deduce the Newton’s expansion of the 
required function f„ f z), The numbers C ms are taken from Table 
III, or if this table fails, calculated by formula (37) with the aid 
of Table IV. 

The constants of Newton’s formula are given by (38) 

+ G, + C xo O x + — ■) & v 

A f-r, Ca)= C f1 Of+ C g1 Q z + • • ■ ■ + C^i Gy, 
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A Czz @2 + C 32 G 3 + ■■■ - + G n 


A f-rjCa)* C ss 6 S + C s+1 s & s +j + ••■• + C„ s 


Q„ 

Now the equation of the parabola is known in the form of its 
Newton -series 

fi?(x)= f n C*)* ff)A f„ Aa)+ff)A 
(4) 

where 4f= (x-a)/A This will be considered as the desired solu¬ 
tion. As has been said, it is quite useless to expand the parabola 
in powers of x . 

If it is necessary to deduce a parabola of degree r?+t starting 
from equation (4) ; the new coefficients will be 


f r,fa) + *-77+1,0 ^77-+l 

^ ^77 & 71 + 1 - 


Generally a table of the values corresponding to the parabola 
and to x=a,<s+/ 7 y is needed; this will be computed by the 
method of addition of differences (§11). The last column will 
contain the required values. 

§ 13. Example 1 . Let us choose an example given by 
Lorentz 15 in which six values of y are given, and where /V*d» 


15 loc. cit. 8 , p. 21. 
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h*2 and a*-5 . The approximating parabolas of degrees 1, 2, 
3, 4. and S are required. 

The values of y corresponding to 1, J are 

written in the first column of the table below in reversed order of 
magnitude of x • The other numbers of the table are obtained by 
Chetverik off’s method (§9). 


12293 

12293 

12293 

12293 

12293 

12293 

10875 

23168 

35461 

47754 

60047 

72340 

10058 

33226 

68687 

116441 

176488 


10018 

43244 

111931 

228372 



8530 

51774 

163705 




7880 

59654 





The required 

mean-binomial-moments will be 



Z -39634/6 = 9942, 3333 * T 3 =176488/13=11763,8667 
Z -16370/13*109136667 77340/6=17036,6667 

T 2 = 778377/70 = //*/<? <5 £ = 17793 

The mean-orthogonal-momcnts are 
G 0 'T 0 = 9942,7311 
G±*T, - T a = 971,3333 
G Z -2T Z -3T,+T 0 = 36,3333 
e 3 = 5T 3 -!OT z +6T,-T 0 = 183 
G a ~14T 4 -33T 3 +30T s -10/ + / = 331, 666 7 
G s = 47/-126/+140/ - 70T 2 +13/ -/ « -1132 

The squares of the mean-orthogonal-moments are 

0 /= 96849963,46 G/ = 33489 

9? = 943488,36 123669, 43 

&** 1484,62 OJ= 1327104 

Zy*/6 = 100960410,3 

♦Editor’s note. In this country we usually write QC02 5J55instead 

Oi 9942,3333 For the purposes of this paper I prefertouse the con- 
tinental notation appearing in Professor Jordan’s manuscript. I agree that 
the following typical product of three factors ff- • 34^ 77?9J * f ) 

appearing on page 300 is less liable to be confused than if written 

j; ■ 341.775-95 



CHARLES JORDAN 


295 


From Table III we get: 

C 10 - - 14285714 = 0,21428571 

4o= 1,76871429 C so =-O,O2380952 

C 30 - -0,83333333 


Now the mean-square-deviations for the parabolas of degree 
0, 1,2, 3, 4,5 will be: 

<g =Zy*/6-e* = 2110424, 62 
a, z =a y f-C.G,*. 88664,05 

4 - o; 2 - C 2 9* = 86012, 56 
a? * of - C 3 6/ = 56105, 08 
4*crf-C 4 04= 31604,49 
* ai- C s 9* - 6,80 

As the parabola of degree 5 passes through the given six 
points, therefore <5* should be exactly equal to zero. 

Let us now proceed to the determination of Newton’s formula 
corresponding to the five parabolas required. It is to be observed 
that the amount of work required to solve this problem, is inde¬ 
pendent of the number of observations. First the following num¬ 
bers must be taken from Table III: 


C„ = d ?65714286 C Z2 - 1,07142857 
C Zi = -2, 14285714 C 3Z ---2,5 

C 31 *Z 0^1,92857143 


c<r - 3 

C S3 = 1,3531353 


C Z1 = -0,85714286 C sz =-0,5 C s4 . = -5 

C S1 = 0,14285714 C 33 - 1,6666666 7 C# = 6 



296 APPROXIMA TION BY ORTHOGONAL POLYNOMIALS 
Formulae (38) of the preceding paragraph give: 

f/a)=O e +C 10 e, = 7860, 90480 

f z M* fA)+C 20 G z = 7929, 71427 

f 3 (k)- 4 (a)+C 3c Q a - 7777,21427 

f/*)- f 3 a)+C 40 e 4 - 7812,17142 

f s (*)-V*)+c so e s - 7779, 99998 

Af/a)^C 11 & 1 = 832,67140 

Af/al-Af/aJ^Q * 7 90,00001 

A f 3 (*) - Ai a (a)+C 3t Q 3 = 1116,00007 
Ai 4 (*)=Af/a)+C 41 & 4 = 814,17114 

Af/a)-Af/a)+C 31 9 s = 610,00002 

off/a)=c 43 e 2 - 41,28760 

A*f 3 (*)*A z f g a)+C 3t Q 3 --416,21432 
A 2 f/a), A% M+C 42 G 4 = 262,00003 
A*i/a)*A%(a)+C 52 Q s -838,00003 

A 3 f/a) - C„ 6> 3 = 307, 

A 3 f 4 6a)* A%7a)+C 43 0 4 —707,00010 
A 3 f s (a)*A 3 f 4 (a)+C S3 G s — 2286,00010 

a\ 6)* c 44 e 4 -1055, oooio 

A% Ca)*A A f 4 U)+C s4 e s - 4511,00010 


A%fa)-C SS G S 


-6912 
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Before writing the formulae of the parabolas, let us introduce 
fa-#)//?*Zand put f('a+%h)=F{& we have then 

F/F)= 7860, 905+ 832,571(F) 

F/?) = 7929, 714 + 750(f)+4J, 886 (f) 

GW- 7777,214+1116(8)- 4J6,2i4ff)+305(() 
G<V- 7852,571 +m,57l(f)+262/(()*705(()+1055(2) 
F/O- 7780+650(()+&58(2)-22867{)+451l(2) 

- 6912 ( 2 ) 

These results are exact to three decimal places. 

The method of the addition of differences can be applied imme¬ 
diately to these equation. 

Example 2 , The values corresponding to the approximating 
parabola of the fifth degree and corresponding to the given values 
of a: in the preceding example are to be determined. 

Starting from the last equation above, we will determine 
for Noting that A S FfO) = -&912 the table 

below is obtained by using the method described in §11. The last 
column contains the required values of which, as in this 

- case the parabola passes through the given points, should be 


exactly equal to the given values y 

in Example 1. 


4511 

-2286 

858 

650 

7880 

-2401 

2225 

-1448 

1488 

8530 


-176 

777 

40 

10018 



601 

817 

10050 




1418 

10875 





1 2293 


The results are exact to three decimal places. 

Remark on the number of decimals to which the calculations are 
to be carried out. If for instance a parabola of the fifth degree 
is to be determined approximating the values of y , which are 
given with a precision of half a unit, then should be cal- 
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ciliated to seven decimals if the number of the given values is 
near SO, or to eight decimals if it is near one hundred; A f fa) 
should be calculated to six or seven respectively; A 3 f (a) to five 
or six, and so on. The corresponding orthogonal moments and the 
numbers C m$ must be of course calculated to the same number 
of decimals. 

Example 3. Changing the origin. The given values y in 
Example 1 correspond to 

July 11901 +% 

for where £ is expressed in years. These values 

have been approximated by F 5 C^) obtained in the last equation 
of Example (1), to abbreviate we shall write it F(£) . In this 
case, 4 fv/y 1J901 corresponds to a = 0. It is required to develop 
the function F{%) in a series of binomial coefficients (), 
where c=-^ corresponds to JA/vuxjry 1J901 . Equation (7) 
will give for h=l and with an exactitude of three 

decimals 

AF(~P = -6912 

4511 69/2 = 7967 

A 3 F7-£)- -2286 -i •4511 ~§ • 6912=- 7133,5 
2?F{~j)= 816 ■2266 +§ ■ 4511 +£-6914=5834,625 

AF(-j>) = 650-1 -838-§ ■4486-f 6 ■ 4511 

~ j§Q - 6 912 = - 3925, 938 
F(~i) = 7780-i -650+ §-838 +£ -2286 
*, J§ ■ 4511 +M1-6914 = 11418,102 
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Finally the required formula will be 


Ff?)-11418,102-392}, 938( f /)j.;832, 623( f /) 
- ri33,3( r ;ij* 7967^/)-6912^/j . 


This equation was checked by calculating FfO) which was 
necessarily equal to 7780. 

Example 4. Changing the interval. Let us suppose the follow¬ 
ing function given 


ffa)- 7780 




yX-f-5 
( i 




838/ x+5) 2286/z+5) 
2* ( 2 'z 2 3 1 3 'z 


, 4511 / x+5) 6912/x+5) 

~2* ‘ 4 'z ( 5 4 * 


This is a N <?wfo«-expansion in which the decrement of the gen¬ 
eralized binomial coefficient and the increment of the differences 
are both h=Z. It is required to deduce another iVmfott-expansion 
such that the mentioned decrement and increment should both be 

k-v 3 . 

For this purpose it is necessary to calculate 7ff(x)iat x*-5 
and 1,2,3, 4, 5 • First we must determine the numbers 
A^j/h m introduced in §4. 

For h=2, and k= 1/3 we have 

Aj /h * 1/6 A^/h - 5/72 

A* /h55/1296 A* A^- 935/31104 

A l s /h s = 21505/933120 
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A z Jh* * 1/36 A*/h 3 = - 5/216 

A*Jh*= 295/15552 A*/h s - -55/3456 

A 3 Jh 9 -1/216 A*/h+- -5/864 

A 3 s /h s ~ 185/31104 A//h 4 ~ l/l296 

At /h s = -5/3868 A// h s = 1/7776 

Now we can proceed to calculate the differences A s f(a ) given 
by the formula in § 4. We have 


Af('a)^f-670-^-838-^ 6 '2286 -jjj^ • 4511 

~9F3&d ’ 6912 “ ' 341 > 77393 
2ff(*)=± - ^<5 ■ 2236 + ■ 4511 

+3TJd ‘ 6 ? J2 ’ 271, 26190 

77 79977 

A 4f7a) = lh~6 ■ 4311 she ■ 6912 * *2. 36960 

A s f(a) = -= - O t 86869 
Hence 

f(x)- 7780-^-341, 77395/*i S )+^-271j6l90( x + s ) 
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-p777,79977/V)+p (It,3696X*4 5 ) 

-fp-(0 J 88889X*s S ) 

where k^l/3. If we want to apply the method of the addition 
of differences it is preferable to change the variable by introducing 
^(x-a)/k and writing FCK)- f (a+£k). We have 


FO?) *7780-341, 77393(f)+871, 7619(f) 

- 77, 79977(f)+13,3696((p-0,68889(8). 


Of course it would have been better to change the variable before 
beginning to calculate the numbers , but we wanted to show 
the method in its generality. It is advisable to check the above 
equation by putting into it 6 : the result must be Ff&)- f(l)= 
3430. The checking has given this number with a precision of 5 
decimal places. 

Starting from A?F(f)=-0,88889 , let us determine the first 
values oiFff)ior f =0,1,4? 3, --6, by the method of the addition 
of differences 


12,36960 - 77,79977 

11,48071 " 65,43017 

10,59182 - 53,94946 

- 43,35764 


271,76190 

193,96213 

128,53196 

74,58250 

31,22486 


- 341,77395 

- 70,01205 
123,95008 
252,48204 
327,06454 
358,28940 


7780 

7438,22605 

7368,21400 

7492,16408 

7744,64612 

8071,71066 

8430,00006 


§ 14. First problem of correlation. It is required to determine 
the coefficient of correlation r nm between the deviations f n fa)-y 
of a parabola of degree 7? approximating to the values y (§3) 



302 APPROXIMATION BY ORTHOGONAL POLYNOMIALS 


and of the deviations of a parabola of degree 777 ap¬ 

proximating to other values y‘ corresponding to the same x 
quantities. Let us suppose that n tm. 

The definition of this coefficient is the following: 


(40) T'yjrrj 


J_ _ 


b 

£ 


x=a. 




where is the mean-square-deviation of f n fa) and y (§ 8) ; 
and the mean-square-deviation of f n fa) and y / . 

Let us put into (40) the values of fa) and expanded 

into series of orthogonal polynomials (14); then the sum in the 
second member of (40) will be 


( 41 ) Zyy , -Ly&&+ 

~£y'(a 0 U 0 ■+<a 1 U 1 + - *a T 7 U T1 ) 
+a 0 a' 0 £U*+a 1 a' 1 ZU 1 *+ - + 

This expression may be simplified; indeed, starting from equa- 
tion (15) we have, after multiplication by a' 


4Zyff 9 -* s «' s ZU* 


Putting into this equation successively <s « Q -<2 • • • and adding 
the results, we obtain the values of the second sum of (41). To 
have the third, we start from the equation analogous to (15) 

Zy'U s = *' s ZU s z 
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and multiply both members by a s ; putting successively 
• 77 and adding the results, we obtain an expression for the third 

sum of (41). Finally this formula will be 






We have still to determine the quantity & s a' s £Uf /N; but 
according to (24) and (32) this expression is equal to 
where is the mean-orthogonal-moment of degree rn of y' - 
We conclude that, 

<«> r nm .^\Lzyy'-e' ! &J-c,6i,e;--c„%4 1 . 

r? “ 777 L J 

Sometimes, n being given, the coefficients of correlation r run 
are sought for all values of ■m = 0 J l l 2 l 77.- In this case we will 
first divide the quantity within the brackets in equation (42) by 
CT n and then divide the quotient successively by the values of 
for m = 0,I,2, rj. 

All the quantities figuring in equation (42) are known from 
the previous determination of the approximating functions f n fc) 
and fx), except ZLyy'/N. To obtain the desired coefficient of 
correlation, it is necessary to compute this additional last quantity. 

Formula (42) also shows the importance of the mean-ortho¬ 
gonal moments of the given quantities y and y '; it is independent 
of the origin, of the interval, and of the constant of the orthogonal 
polynomial chosen. 

The most important particular case oir nTT} is r 00 ; this being 
the co effici ent of correlation of the deviations of y and of y' 
from their respective averages. Thus r 00 shows the simultaneity 
of the variation from the respective averages. Another important 
particular case is , which gives the correlation between 
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and f^fah/r thus measuring the simultaneity of periodical devia¬ 
tions from the respective linear trend-lines. 

If m and m are large, the approximating parabolas will follow 
the principal secular and periodical variations, will therefore have 
nearly the same maxima and minima as the values y and y' . 
In this event the remaining deviations will be mainly due to chance, 
and thus the coefficient of correlation loses its importance. 

§ 15. Second problem of correlation . Given the functions 
and f^fa), of degree n and \J respectively, approximating the 
values of a quantity y . Let us denote by <f the deviations of the 
two corresponding parabolas: 


f - 


Moreover, let us likewise have given the functions ^ fa) and 

of degree m and^a respectively approximating the values 

of another quantity y f , and denote by r) the deviations between 

the corresponding two parabolas. 

It is required to find the coefficient of correlation r ., 

r?l/> 777ILL 

between the deviations f and 9 . According to the definition of 
th coefficient of correlation we have 


(43) [ f ^' 

where cf nl f denotes the mean-square-deviation of f , and a' z 
the mean-square-deviation of ry (§ 8 ). Both are known from 
the determination of the approximating parabolas. 

Let us suppose that 77 £ m>\t = /a.* Substituting, for f n fa), 

, f m fa) and their expansions in orthogonal poly¬ 

nomials (14), the sum in the second member of (43) will be 
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^^xl+l U y+1 + '” + *n U n)t d M+l U u+1 * ' ‘+ a m U m)- 


This yields, in consequence of the orthogonality of the poly¬ 
nomial u s , 

a V+l d j+ j + ••■+ a m a^EU ^. 


As we have seen in the preceding paragraph, 

Ti W .c,e,e;. 


Hence 

<“> Iv, m„ 

All the quantities of the second member of this equation are 
known from the previous parabolic approximation, so that the 
calculating of the coefficient (44) is easy. We note that it is a 
simple function of the mean-orthogonal moments, of the mean- 
square deviations and of the number^ . (formula 34 or table 
III). 

If m and ju are gwen and this coefficient must be computed 
for several values of n and \J , we first divide the quantity within 
the brackets of formula (44) bytfj^ and then divide the quotient 
successively by the different values of 

In the second problem also, the most important particular case 
is that of especially if n and m are large, in which case 

ft O 777 O * 
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the deviations y^fo/-yand fej-y could be considered as negligi¬ 
ble. In this case the coefficient of correlation (44) of the trend- 
lines is much more important than the coefficient of correlation 
(42) of the trend-deviations. 

Example on correlation . A. Sipos has determined trend-lines 
up to the third degree of Hungarian imports and exports in 1882- 
1913. 16 The mean orthogonal moments for imports were 

e o * 12 J4,23938 Q z * 70,63941 

e, . 206,02671 Q 3 - 21, 27341 

The mean-square-deviations corresponding to the parabolas of 
imports of degree 0, 1, 2, 3 were 

CT 0 = 383,777 o z = 83,952 

cr i = 166, 311 a 3 - 69,330 

The equation of the third degree parabola of approximation was 

f/x) =864,12484 3Q36Z(*)-2,d2064(Z)+C>43304{Z) 


where x=0 corresjjonds to 1882 and T'x) is given in millioir 

gold crowns. 

The corresponding values for exports were 


1234,4 

0^ = 37,80645 

0,'= 192,673 

OJ « 5,58515 

crj* 37,387 

efc'- 58,480 

<= 96,662 

crj- 57,184 


tJV f- f^.. Prakti ^he Anwendung der Trendberechnungsmethode von 
* der Unganschen Landeskommission fiir Wirtschafts- 

statistik und Konjunkturforschung. Budapest 1930. 
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The equation of the third degree parabola of exports was 

fs'M- 821,24103 *15, 09955(¥)+0 j 26944(^0,11947( 

First problem of correlation. Let us determine the more impor¬ 
tant particular cases of the coefficient of correlation r.' r}77}> between 
the deviations import and the deviations y - of 

export. 

The coefficient of correlation between the deviations of the given 
quantities and their respective averages is 

T-Zyy'-S><]. 

and since y 72457, 80 we have 

r 00 -O 9 9586 

This correlation is a very strong one. 

The coefficient of correlation between the deviations of the given 
quantities and their respective linear trend-lines is 

\ 5 Eyy ‘~ e ° e °~ Cieie ’ ] ~°> 7669 - 

(The number C / was taken from table III. * 2,81818 • • •) 
This correlation is still strong enough. 

The coefficient of correlation between the deviations of the given 
quantities and their respective third degree trend-lines is 

\pZyy'-e o &J- 0 3 Oj ] 

« 0,1739. 
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This correlation is already very small, the deviations are mainly 
due to chance. From Table III we had C z = 4,14436303wa.A 
C % ~ 4.80748663. 

Second problem of correlation. The coefficient of correlation 
r , between the deviations of two trend-lines of import of 

77V,TnjU # . 

degree 77 and respectively and the deviations of two trend-lines 
of export of degree 7n and ju respectively, is to be determined. 
We will only consider the most important particular case, the 
correlation between the third degree trend-lines and the respective 
averages, that is 


30.30 


-f—r [C,e f °f * c * 4 <3 1 

*jo a so L J 


where 


C,e?r C 2 C 3 of* 142659, 2209 


and 

c, e' ^ c z ef* C 3 e' 3 z * 110699, 9496. 

And therefore a 30 = 377,70296 and <?J 0 = 332, 71000. 

The quantities in the brackets have already been calculated in 
the first problem, so that we have 

r soj3o * Q 9323. 


The obtained value is very near to that of r oc calculated above, 
proving that the third degree trend-lines well represent the given 
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quantities. This would not be true in the case of the second degree 
trend-lines; in fact we should obtain ^zo^o s ^ 956^widely dif¬ 
ferent from r 20 obtained before. 

§ 16. Some- mathematical properties of the orthogonal poly¬ 
nomials. Symmetry of the polynomial U m . If we substitute in 
formula (18) a+b-h-x for we get 

u m (.*b-h-«)jCh m ECS)( *"*)( *£2) 


■Ch m 


777 )/*-& t mh-sh ) 


777-$ S 


and putting m-jjL, it follows that 


(45) U rn 7'a*b-h-x)^('-l) n ' U m (x). 


This formula shows the symmetry of the polynomial. 
Let us consider the particular case, 

X -<2 =^/ b-a-h) = kh. 


We have, then, 
b-d= (2k+l)h, 

x-a = kh 


N= 2k+i 

x-b*-(k+l)h. 


From (45) it follows that 

TJ m U+kh) = (-l) m u^fa+kh). 
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Hence we have 

From equation (45) we easily obtain 

AU m C*)= C-i) m ^AU m f*+b-2h- x) 

and 

A s l/ m C*).(-l) s U m Ca+b-sh-x). 

Function-equation of U^fx). This can be deduced in the fol¬ 
lowing way: let us develop x U m into a series of orthogonal poly¬ 
nomials ; we find that 

as. in consequence of the orthogonality of these polynomials, the 
other terms vanish; indeed, if// is different from m-l , 777 and 
777 +J, it follows that 


b 



Since equation (46) holds for every value of x ,, and U m (k) 
is known for three particular values of x , we can determine the 
coefficients A s - 

We know these values, since equation (21) gives for b 

U n ,(b).CMh”( b -*Z, J " h ) " 

17 As C can be dependent of 777, we will write Cf m ) in the following 
formulae, instead of C . 
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and for a;*a , after changing a into b and inversely 

Moreover, in consequence of the above-mentioned symmetry of 
the polynomials, we have 


The two last equations give by using formula (46) 


(47) * U m M . tk, cr„ 6J. 

and 

U„ (M) 

Multiplying both sides of equation (47) by (-1) m and adding 
it to (48), we find that 

{b+a-h) U m Cb-h)= 2A m U m fi*>-h) 

and 
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Multiplying both sides of (47) by (~l ) m and subtracting it from 
(48) yields 

Since 

U m CfmJh V 

we find that 


ifb-a-hjcfajh CMJ 


(49) 


+ A Tr,+ 1 C ( n i+Vh 


2 ('i>-a-n r ?hXb-d-mh-h) 
mfrn+JJ 


As, in consequence of the symmetry of the polynomials, we have 


u m cu. r-v m v„r«-v. cr*)h »f *•*£”*; 

hence we can deduce two other equations analogous to (47) and 
(48), i.e.. 



CHARLES JORDAN 


313 


and 

(a-h)U„M. A^U^Ce-h)* A„ U m (,-h+A^D^, (,-h). 


After multiplying the second equation by and subtracting 
the result from the first, we obtain 


ir^h) u m c». a„_ , 


or 


(.50) j('b-d+hJC('m)h — A m _ 1 C fa-1) 

Ab-a+mhXb-A+ mfi+h) 


* A m +i CfrrH-f) h* 


777+1 J 


Finally subtracting (50) from (49), and simplifying, we have 


Cfa>) 


2(27T,+1) hCfa+l) 


V\ e deduce A m/ from (49) by substituting therein the above 
value for A m ^ , yielding 

Ct>-<i) z -m 2 h 2 hC fa) 

2 fa 777+1) C fa-1) 
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Now the function-equation (46) is known. 

We might have proceeded in another way. Having obtained 
A m by using the equations (47) and (48), we could determine, 
tor instance in the usual way by multiplying both mem¬ 
bers of equation (46) by , and summing x from a to t? , 


b 

z 




A 


m+/ 


Z*d 




2 

/ • 


Since Z + is already known from (24), we need only 
determine the first member and this may be done by applying 
formula (3) to the quantity 

ZU m UVJ. 


Difference-equation of (x). We will start from 


According to equation (1 ) we have 






X-d+b ) 
777+1 


Therefore 
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Applying to this expression formula (2), giving the -m+ltk dif¬ 
ference of a product, it follows that 

( 51 ) AV m +t 

Now let us deduce a second formula for f . We can 

write this quantity in the following manner 

AVU-CA-*[(tZ)/£),Q. 

+U<mh)Cb+mhj\^(*-*J h f*m%] } ' 

Again using formula (2) to deduce th&(m+2)-ih difference of 
the preceding product, we have after simplification 

Au ^rdnP fa* +**X*-b*2h)a*u m 


+(,m+2)h('2?6+3h<r<a.-b)AU m * (n+lXm+Zjh 2 U m J 
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Finally, taking into account (51), this equation results in 
(x-dL+2h)(x-b+£h)A*U m + ^?z -a. -b+3h- h A £7^ 

(52) 

* 777(777-hi) h *17^ * O. 

This is the required difference-equation; it is a linear equation 
of the second order and can be solved by Boole's method. 18 If we 
put %=(x-a.)/h the solution will be 



This expression of V r m differs from those we obtained in par¬ 
agraph 6. 

The roots of U m 6c)=0. L. Fejer 19 has demonstrated the fol¬ 
lowing theorems concerning these roots: 

The roots of all real and single, they are all sit¬ 

uated in the interval 


<2, b-h 

Whatever % may be. in the interval a+fh, a+fh+h there is 
at most one root of 


o> 


Fejer showed moreover that if g m (x) is a polynomial of 
degree 777 , and if in its Newton expansion the coefficient of (£) 


isc °°Hi Cjjk’d 11 ® 0 * Finite Differences, 1860, p. 176. 
See the Appendix in loc. cit. 5 
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is equal to unity, the polynomial which minimizes the following 
expression 


A M 


z 


is the orthogonal polynomial U m fc), with the constant C suitably 
chosen * Indeed the first conditions of a minimum are that 


z 




for 


O = v/< m 


and these are identical with the conditions of orthogonality (14). 
The second condition of the minimum is always satisfied in these 
cases, as has been shown in § 5. 

§ 17. Graduation by orthogonal polynomials. Let us consider 
an odd number of consecutive values of x. , say 2k+i , and the 
corresponding values of y . A smoothed value of y is wanted 
for the central term, viz. for x=a+kh . This will be obtained by 
determining, according to the principle of least squares, a para¬ 
bola of degree m , so that the sum of the squares of deviations 
between the parabola and the points x t y shall be a minimum. 

The equation of the parabola expressed in orthogonal polyno¬ 
mials (13) will be 

fc)m a c + <a i Ui fe)+ - + aOc) 


and the smoothed value required is given by 7^ (a+khJ. 

In consequence of the symmetry of the polynomials, formula 


♦See Essher’s first polynomial in § 21. 
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(45), we have O, so that the equation of the 

parabola will give 

f n (a+kh)* a 0 + <3 jL£ (ct+kh)+ d+ZhCa+kh)*-* 


From this formula we see that it is useless to consider parabolas 
of odd degree, as for instance a parabola of the second degree 
will give the same smoothed value for y as would a parabola 
of the third degree. Therefore we will consider only parabolas 
of even degree. 

The values of U^fa+khjzxt given by our formulae (18), 
(21), etc., but we can obtain a much simpler formula for them, 
starting from the Function-Equation of fa) given by for¬ 
mula (46), which, since A m = a+k/? , we can write in the fol¬ 
lowing manner 


(Zk+lf- m z Um-i 
2(2 m+i) C ( 777 - 1 ) 


+ {a+kh-x) JUjEl _ 

h 3 CC777) 


+ J™12L J%22±1 

ZCZm+l) nCTrn+1) ' 

This holds for every value of as . For*- a+kh the term in 
U 2rrj will vanish and we have 


£7 / (a*kh )=- ittkUL ih±i±JZ CCrn+1) 
™ "m+i m*.1 C(m-l) 




kh). 


This equation can be solved by putting into it successively 177 = 1 ^ 3 /- 
It follows that ' 1 ‘ 
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U z (a+kh)=-k{k+l)Cf£)h A 

uja+kh), (Zx k ; z )ca)h e 


and so on 


u^khu-jrh * m cr*m)C)( % m ). 

For a ?w we have found in § 7 

_ (4 m-tl) G Zvj 
Zm Cfrn?)h ^ + 2m ) 

X 777 / 


Hence if to abbreviate we write 


(54) 


^Zm 


1) m {4 m+J) f 


sKirn ) 
X^TT? ) 1 Z 777 / 


we have 


^ " ^777 


and finally the required smoothed value of y is given by 

(55) t t „r~kh). e 0 - ^ . 
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This formula is also independent of the interval and of the con¬ 
stant of the orthogonal polynomial used. 

The values of ^ necessary up to parabolas of the tenth 
degree and up to 29 ordinates are given in Table II, so the cal¬ 
culation of the graduated value is very simple. All we need do 
is to compute the mean binomial moments by Chetrerikoff’s meth¬ 
od (§9) and calculate the corresponding mean orthogonal mo¬ 
ments & 2r71 by formula (30). This must of course be repeated 
for every value which is to be graduated. 

Example 7 . Nine point graduation, employing second and 
fourth degree parabolas. The given values are 


X 

y 

X 

y 

0 

2502 

5 

2904 

1 

2548 

6 

3064 

2 

2597 

7 

3188 

3 

2675 

8 

3309 

4 

2770 




The mean binomial moments were calculated by the method of 
§ 9, and were found to be 

T 0 - 2839, 6666 7 T s = 3182, 21429 

T t - 3014, 97222 T 4 * 3223, 37302 

T z = 3117, 36667 

Hence the mean orthogonal moments are 
T c = 2639,66667 G+ - 14 T 4 -35T B +30T Z - 10T t 
37,+T 0 -29,08335 + 1-10,33148 


From Table II, we take S z --1,81818182 and 5^ 1,13286713. 



Table 1L Graduation. 
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cK oo v© r* 

vp vo 3* 

\o to o o ov 

1—I to t>* *-« ^ cvj 

oo ir» o vo ^ cvj 

go oo *** -a* <r> oo o 

O VO ON C© tN. O 

u-> o f>. o to »o o 

O r-4 ^ ?5 po n- »o 

o o' o cT o o' o 



- 2,2413 7931 2,3426 0289 -1,8798 6652 1,2291 4349 - 0,6647 9271 2 

- 2,2580 6451 2,4017 5953 -1,9824 0469 1,3487 3583 - 0,7689 6251 1 
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Therefore the smoothed value with a parabola of the second 
degree will be 


£(4)* e o + S z G z -2706,73705 


In the paper loc. cit. 9 (p. 31) these values have been approx¬ 
imated by a parabola of the third degree, the value obtained for 
f 3 (4) was (p. 33)/?/%} 78763 according to what has been said, 
this value should be equal to the obtained results for f z 74) . 

Finally the smoothed value corresponding to a parabola of the 
fourth degree is 

f+C4)*f z 74)+S+&4 = 277^0637 


Bibliographical and Historical Notes . 20 

§ 18. It was Chebisheff who first introduced orthogonal poly¬ 
nomials with respect to a discontinuous variable. He 21 especially 
treated the case of non-equidistant variables, from the mathemat¬ 
ical point of view, in a very interesting manner, but his results 
were necessarily complicated. As we consider here only equi¬ 
distant variables, this paper will not be discussed. 

But Chebisheff also investigated the case of equidistant poly¬ 
nomials in two of his papers. In the first “Sur Interpolation par 
la Methode des Moindres Carres” 22 he denotes the orthogonal 
polynomial of degree m by <p m 7z) ; the variable x taking values 
differing by unity, from ~~ 77V-J) to £7N-j) inclusive. His 
polynomials can be obtained for instance from our formula (21') 
by putting therein h=l, 7N~4) and C7rniJ z ■ Formula 

(24) will give Z<p£ by putting into it and 

where mi stands for *• 777 . 


f jhe numbers of the formulae quoted refer to the present paper. 
**Sur les Fractions Continues. 1855. Oeuvres, T. I. p. 201. 
22 0euvres 1859. Oeuvres, Tome I. p. 474. 
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In the second paper “Interpolation des valeurs equidistantes” 
in which he introduces such polynomials, 23 he denotes the poly¬ 
nomial of degree m by cp m (x) and the variable x takes the 

values 1, 2, 3,_These polynomials can also be obtained 

from formula (21') and 71 (p^ from (24), by putting in them 
c?= 1, N=n-1 and C - {m!) z . 

§ 19. J, P . Gram utilized orthogonal polynomials for gradua¬ 
tion according to the principle of least squares. 24 He denotes the 
polynomial of degree m by , the variable z taking the 

values 


-i/'/v - 1 ), • • • •. -IQl, • • • A(n-i) 

where N is an odd number. Then writing 


he determines the coefficients by formulae 

jf/V+l) 

Z X s y/ m fe)= O tor OH s<m. 


There are 777 such equations and rn+J unknown coefficients 
one of them therefore will be arbitrary. Gram disposes of 
the first coefficient which is different from zero in such a manner 
that all the values of corresponding to the considered 

values of x shall be integers and as small as possible. 

For instance in the case of ^(x) , it follows that oc 0 = O , and 
in order to have tyfx) * x , he puts 1 - 1 

Practically he uses only polynomials of the first, second and 

231875. Oeuvres, T. II. p. 270. 219. 

24 Ueber partielle Ausgleichung mittelst Orthogonalfunktionen, Bulletin de 
PAssociation des Actuaires Suisses. 1915. 
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third degrees; and gives tables for %fx) and ^(x) available 
for N=7,9,llr--j21 an d containing the corresponding values 
of 


The values of ^ (x) can be obtained, for instance, from our 
formula (21'), if we put therein h-1, a=~z(N-l) , and either 
C=-l , if f not divisible by three, or C--j ii 
it is so divisible. 

The values of are also obtained from the same formula 

by putting into it N-l) and either C= - ^ , if 

(N-2)(/V-j) is not divisible by five, or 1 /zo , if it is so 

divisible. C-Vz corresponds to ^ fe)- * • 

Formula (24) would give the values of fZ^indZ:^ 
of the table, if we were to substitute h*l , and for C the 
respective values above-mentioned. « 

In order to give an example, Gram calculates the smoothed value 
corresponding to an eleven-point parabola of the second degree 
(p. 12). His calculation is short enough, but our general method 
is still shorter, as may be judged from the following determina¬ 
tion of the smoothed central value of his example. 

In the first column we have written the values of y corre¬ 
sponding to a reversed order of magnitude of x ; the other col¬ 
umns have been obtained by simple addition, as indicated in § 9. 


194 

194 

179 

373 

212 

585 

124 

709 

780 

1489 

000 

1489 

504 

1993 

244 

2237 

000 

2237 

582 

2819 

000 

2819 


194 

194 

567 

761 

1152 

1913 

1861 

3774 

3350 

7124 

4839 

11963 

6832 

18795 

9069 

27864 

11306 

39170 

14125 

... 
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lienee 


T 0 = Q 0 - 2819/11 = 296,272/27... 
T, = 14129/55 = 256, 81818 ... 
5- 59170/169 = W 595959... 


and 


<2, = 2T z -3T 3 + T 0 =-59, 5959... 


'I'd have the smoothed value, we take from Table II.. 
S z =-1,92507694- 


,md it follows that the required smoothed value is 


£ 7a* kh)* e o * % * 532,0309 


agreeing with Craw's result. 

Although the calculation has been executed to nine figures, it 
is a very short one. 

§ 20. Ch. Jordan “Sur une serie de polynomes dont chaque 
^omme partielle represente la meilleure approximation d’un degre 
donne suivant la methode des moindres carres.” 20 (1921) In this 
paper the author treats the mathematical theory of orthogonal 
polynomials for equidistant values in the general case. Many of 
their mathematical properties are demonstrated: formulae, differ¬ 
ence-equations, function-equations of these polynomials are given 
and some interesting propositions concerning their roots are dem¬ 
onstrated. Two particular polynomials were introduced. In the 


^Proceedings of the London Mathematical Society. Vol. XX., P. 297. 
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first, denoted by Q m (x), x takes the values <3, a+h ) a+2h J '-, b-h> 
where b = /V/ 7 ; and /V is an integer. 'Hie constant C was 

chosen as 

C= ml/2 777 h 

This was done, as has been mentioned, so that for , b*l 

and h=0 the limit of the polynomial fe )should be identical 
with Legendre*s polynomial. 

The second particular case denoted by q m (x) was obtained 
from feJ by putting h*l , a^O and b=N. There are tables 
in this paper giving the values of q w fc) for ?v=l i 2 J 3, 4, 5 f° r 
N=?v+l t m+2 J ~y 2?and for the values of X- 0,^2^ •*, TV~J. 
Moreover there is a table giving Zlqjf for 777 = i J 2 J 3^4, 7 and 

for /V* m+i\ m+2, • • * ^ 20. 

The problem of approximation was solved in this paper by 
formula (14) of the present work in the following manner:—the 
coefficients a m were calculated from formula (13)above lEyq^tx) 
was computed first by multiplying every value of y by the cor¬ 
responding values of taken from the tables mentioned 

and then the products were added. The quantity was 

taken likewise from the tables. 

The coefficients a^ being known, the mean square deviation 
was calculated by formula (32'). And finally, the required values 
of f n fe) were obtained by using formula (14), and by taking 
the necessary values of q m fe) from the tables. 

In a second paper “Berechnung der Trendlinie auf Grund der 
Theorie der kleinsten Quadrate M ■ ^<, the determination of the coeffi¬ 
cients has been much simplified. These were obtained by 
multiplying the binomial moments by certain numbers, and were 
easd^ calculated with the aid of a table of binomial coefficients 
( y) . These tables exhibit the values given for /V up to 55 and 

^Mitteilungen der Ungarischen Landeskommission fur Wirtschaftsstatistik 
und Konj unkturforschung. Budapest, 1930. 
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for \J up to ten, and are sufficient for parabolas up to the tenth 
degree. Although the calculation of was not necessary for 
the determination of a m , nevertheless it had to be evaluated 
for the determination of the mean square deviation. For this 
purpose a very simple formula was given for (p. 45). 

In this paper the method of approximation by orthogonal 
polynomials has been freed from the tables giving the values of 
these polynomials corresponding to the given tc values. These 
tables would be very voluminous if we wanted them extended 
up to one hundred observations and to parabolas up to the tenth 
degree. 

This has been attained by giving formulae which permit us to 
pass easily from the orthogonal expansion of the approximating 
parabola to its Newton series, which gives directly the required 
values by means of the method of addition of the differences and 
by calculating the coefficients a w without the evaluation of 

EyQ m fxJ. 

The third paper “Sur la determination de la tendance seculaire 
des grandeurs statistiques par la methode des moindres carres.” 27 
(1930) published somewhat later than the second, differs from 
it inasmuch as it introduces the mean orthogonal moments, giving 
(p. 585) a table for their calculation for parabolas up to the tenth 
degree,—the present table I. The coefficients a m , the quantities 

a * 

E(? w figuring in the formula of the mean square deviation, 

and those of <3m '£Q figuring in that of the different coeffi- 

yy 777 

cients of correlation, are expressed by these moments. The calcu¬ 
li • 

lation of EQ m became needless, which is very fortunate since 


27 Journal de la Sociate Hongroise de Statistique, 1929, T. VII. p. 567 
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these numbers are very large if /V is large, and it would there¬ 
fore be difficult to operate with such numbers. The table of the 
binomial coefficients has been extended sufficiently for one hun¬ 
dred observations (/V up to 105). 

Finally, in the present paper the orthogonal polynomials are 
left in the general form, the arbitrary constant remains entirely 
in the background, the coefficients <3 m are no longer calculated, 
the mean orthogonal moments alone are used, and by the aid of 
these quantities the Newton expansion of the approximating para¬ 
bola and also the mean square deviation and the coefficients of 
correlation are directly obtained. \i unnecessary matter has been 
cleared away. 

Table II. giving numbers S 2m useful for graduation, and 

Table III. rendering it easier to establish Newton's formula, are 

* 

new. Table IV. of the binomial coefficients has been extended by 
a few lines (up to 110) in order to suffice for parabolic approxi¬ 
mation of the tenth degree by the new formulae. 

§21. M . F. Esshcr “Ueber die Sterblichkeit in Schweden 
1886-1914”-* denotes in this paper the orthogonal polynomial of 
degree m by P m &)\ x taking* the values of / (FJ-l) 

Since the coefficient of .as' 77 is taken as equal to unity, therefore 
the constant C of the polynomial is, according to formula 

(21'). C s m!/\(*rn)* Putting this value into (21'), and writing 
it will gi\e the values of Esshcr’s polynomial 
corresponding to a given X . Formula (24) gives for the above 
value of C and 

r „ 2_ QzlUA . 

Lr* ■ r*%j 

In a second paper “On Graduation according to the Method of 
Least Squares by Means of Certain Polynomials” 29 , Essher has 

28 Medelanden fran Lunds Astronomiska Observatorium, Lund 1920. 

^Forsakringsaktiebolaget Skandia 1855-1930, Stockholm, 1930. p. 107. 
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employed other orthogonal polynomials, denoting them by 

the variable x taking the values of 1,2,3,. N , Adopting 

Lorenz’s point of view (§22), he chose the constant C in such 
a manner that EX^ should be equal to N . From our formula 
(24) we conclude that in this case 


dr¬ 



ill this way the expression EX^ becomes very simple, but the 
polynomials themsehes become complicated. 

Putting h-i , a-1 , and the above value of C into formula 
\2Y) we have 




777+1 

E 

l UQ 


sx- N-l )S 777+ Y) S N+ 777 ) 
' \J 7T7 777- \A' 


The coefficient a^ in an expression by Esshcr’s polynomials 
is expressed very simply by our method. Indeed from (32) we 
get. if we put in this equation h*l and the value or C above, 

a J77 “ y^T 

where is the mean orthogonal moment of degree 777 and Crr, 
the number given by formula (34), or C m *| taken from 
Table IIL 

As a comparison let us determine the graduated value corre-- 
sponding to Age 52, Essher’s example 10. (p. 116) 

The first column below contains the given values corresponding 
to x in an inverted order of magnitude; the other columns are 
obtained by the method of § 9. The graduation for the central 
value will be obtained by an eleven-point parabola of the second 
degree. 
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674 

674 

674 

674 

873 

1547 

2221 

2895 

1005 

2552 

4773 

7668 

1216 

3768 

8541 

16209 

1331 

5099 

13640 

29849 

1239 

' 6338 

19978 

49827 

1640 

7978 

27956 

77783 

1385 

9363 

37319 

115102 

1366 

10729 

48048 

163150 

1315 

12044 

60092 


851 

12895 




Let us remark, that these numbers figure in Esther's table (p. 
117). We shall have 

T 0 = Q 0 = 12695/II = 1172, 272727 ... 

T f « 60092/55 - 1092, 581813 ... 

T z = 165150/165 - ?< 9 < 9 , 7 < 570 ... 

Hence 

< 9 * = 2T 2 -5T, +T 0 =-127, 896969... 

From Table II. we take 5* =-l, 9250 7691, and finally the 
required graduated value will be 

&o + ' 5 je&*‘ 1416,22 

agreeing with Bssher's result. 

§ 22. P, Lorentz in the first edition of his paper “Der Trend*' 80 
introduced orthogonal polynomials, distinguishing two cases ac¬ 
cording as the number of observations was either even or odd. 
He denoted polynomials of degree m byX m fe) and chose them 
so that EX^ should be equal to /V. 

If 777 is odd the variable takes the values 

-yN-l-1, O, 


S0 Vierteljahreshefte zur Konjunktur for s chung. Sonderheft 9, Berlin, 1928. 
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and the value of C in U m fc) corresponding to the above con¬ 
dition, taken from (24), is 


C« 


/mm . 

/ ) X N+m \ 

\j * T ^/'-Zm + l) 


Hence is given by formula (21') by putting in it the above 
value of C and placing h=l, <?=-| (tS-l), so that we have 




N 




(/Zm ) f N+m \ 
( rn y ' 2 m + 1 J 






If m is even, the variable takes the values 

-{N-V,-fN-3), .. • -1,1,3, • •.. fN-l) 

£ 

and the value of C corresponding to the condition HX m «/Vwill 
be obtained from (24) by putting h=2, so that 


_ 1 f N 

2 Z7n / /2m )/N+m i 


The polynomial is given by (21') by placing in it this value of 
C and h*2, Hence it follows that 




Jo 


N 


(*zxy m 


[ 2m+l 


) 


d / m+ \ZyN+m )/x- N-l) 
yLg Z m / rn- '/■'{ v/ /% 
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Whether /V be odd or even, the coefficient of a m is given by 
our tormula (32), the same formula appearing in Esshcr's expan¬ 
sion, 

s jr t 

rr? rrj 777 * 

Here^is the mean orthogonal moment of degree rr? ,and£~ w is 
given by formula (34), or since C ^ r , by Table III, 

The paper contains five decimal tables giving JC m fcj corre¬ 
sponding to the necessary integer values of X up to m * 5 and 
N up to 60. There are also other tables useful for the transfor¬ 
mation of the orthogonal series into a power scries, and also a 
table enabling one to change the interval of one year into that of 
one month. 

The second edition of the paper does not differ in principle 
from the first. The polynomials remain the same, but the tables 
forA^/^have been extended for rr? up to six, and for N up 
to eighty. 

The only advantage that Lorcntz's method possesses over ours 
is that when applying Chctverikofs method to the determination 
of binomial moments the calculation in his system is a little easier, 
since the numbers to be added contain one or two figures less. 
Hut as this operation is generally made by calculating machines, 
this is but a slight advantage, and this is largely compensated for 
in the subsequent operations. 

As an example, let us determinate the coefficients corresponding 
to Lorentz’s polynomials in the orthogonal expansion of the ex¬ 
ample in § 13. There, the mean orthogonal moments found were 

G 0 * 9942, 3333 - 183 

©i - 97 1,3333 Q^- 351,666 

Q z * 36,5-3 3 3 
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We have seen that C Q =1\ the other numbers C m = \C mQ j are 
taken from Table II., their square roots being 

yJC, -- i, 463 330 109 sjc z = i, 336 306 210 

\tC 3 = O, 912 870 929 Jc z = O, 462 910 050 

JC 5 = O, 154 303 350 


Finally, the required coefficients a m are given by our formula 


a -rr7~ ®rrr]^-7n 


d 0 - 9942, 3333 3 


a,= 1421,83641 


<3, = 51, 4 9233 


a162, 79003 


a l - -177, 5 745 9 


in accordance with Lorentz’s results. 

The corresponding mean square deviations would be given by 

. — 




T77 * 


C 4 u 2 A ^7 





TABLE III 


These numbers are given to ten figures, although generally fewer 
will be sufficient, especially if as is generally the case the mean 
orthogonal moments are all of the same order of magnitude. In 
this event, a fixed number of decimals, properly chosen will suffice. 

The numbers C ms given are checked by the relation: 

rm-L 

21 = 2Tn+l 

s=0 ms 

Remark. 

lim C' n = Zm+l and lim C mr -0 if sAO. 

ms 
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7 / 


c ,i 

0 

0 * 


3 

-1,5 

1,5 

0,5 

-1,5 

4 

-1,8 

1,2 

1 

-2 

5 

-2 

1 

1,428 571 429 

-2,142 857 143 

6 

-2,142 857 143 

0,857 142 857 1 

1,785 714 286 

-2,142 857 143 

7 

-2,25 

0,75 

2,083 333 333 

-2,083 333 333 

8 

-2,333 333 333 

0,666 666 666 7 

2,333 333 333 

-2 

9 

-2,4 

0,6 

2,545 454 545 

-1,909 090 909 

10 

-2,454 545 455 

0,545 454 545 5 

2,727 272 727 

-1,818 181 818 

11 

-2,5 

0,5 

2,884 615 385 

-1,730 769 231 

12 

-2,538 461 538 

0,461 538 461 5 

3.021 978 021 

-1,648 351 648 

13 

-2,571 428 571 

0,428 571 428 6 

3,142 857 143 

-1,571 428 571 

14 

-2,6 

0.4 

3,25 

-1,5 

15 

-2,625 

0.375 

3,345 588 235 

-1,433 823 529 

16 

-2,647 058 823 

0,352 941 176 4 

3,431 372 549 

-1,372 549 020 

17 

-2.666 666 667 

0,333 333 333 3 

3,508 771 930 

-1,315 789 474 

18! 

-2.684 210 526 

0,315 789 473 6 

3,578 947 368 

-1,263 157 894 

19 

-2.7 

0,3 

3,642 857 143 

-1,214 285 714 

20 

-2,714 285 714 

0.285 714 285 7 

3,701 298 702 

-1,168 831 169 

21 

-2,727 272 727 

0,272 727 272 7 

3,754 940 711 

- 1,126 482 213 

22 

-2 739 130 436 

0.260 869 565 2 

3,804 347 826 

-1,086 956 522 

23 

-2,75 

0,25 

3,85 

-1,05 

24 

-2,76 

0,24 

3,892 307 692 

-1,015 384 615 

25 

-2,769 230 769 

0,230 769 230 8 

3,931 623 933 

-0,982 905 982 9 

26 

-2,777 777 778 

0,222 222 222 2 

3,968 253 969 

-0,952 380 952 4 

27 

-2,785 714 286 

0,214 285 714 2 

4,002 463 055 

-0,923 645 320 5 

28 

-2,793 103 447 

0,206 896 551 6 

4,034 482 758 

-0,896 551 723 9 

29 

”2,8 

0,2 

4,064 516 128 

-0,870 967 741 8 

30 

-2,806 451 614 

0,193 548 387 1 

4,092 741 935 

“0,846 774 193 4 

31 

-2,8125 

0,1875 

4,119 318 182 

-0,823 863 636 4 

32 

-2,818 181 818 

0,181 818 181 8 

4,144 385 026 

“0,802 139 037 4 

33 

-2,823 529 411 

0,176 470 588 2 

4,168 067 225 

-0,781 512 605 1 

34 

-2,828 571 429 

0,171 428 571 4 

4,190 476 192 

-0,761 904 761 9 

35 

-2,833 333 333 

0,166 666 666 7 

4,211 711 712 

-0,743 243 243 3 

36 

-2,837 837 838 

0,162 162 162 2 

4,231 863 442 

-0,725 462 304 4 

37 

-2,842 105 263 

0,157 894 736 8 

4,251 012 146 

-0,708 502 024 3 

38 

-2,846 153 846 

0,153 846 153 8 

4,269 230 769 

-0,692 307 692 3 

39 

-2,85 

0,15 

4,286 585 369 

-0,676 829 268 3 

40 

-2,853 658 537 

0,146 341 463 4 

4,303 135 888 

-0,662 020 905 9 

41 

-2,857 142 857 

0,142 857 142 9 

4,318 936 877 

-0,647 840 531 5 

42 

-2,860 465 116 

0,139 534 883 7 

4,334 038 055 

-0,634 249 471 4 

43 

-2,863 636 364 

0,136 363 636 4 

4,348 484 848 

-0,621 212 121 2 

44 

-2,866 666 667 

0,133 333 333 3 

4,362 318 840 

-0,608 695 652 1 

45 

-2,869 565 217 

0,130 434 782 6 

4,375 578 168 

-0,596 669 750 2 

46 

-2,872 340 425 

0,127 659 574 6 

4,388 297 872 

-0,585 106 382 9 

47 

-2,875 

0,125 

4,400 510 204 

-0,573 979 591 8 

48 

-2,877 551 021 

0,122 448 979 6 

4,412 244 898 

- 0,563 265 306 1 

49 

-2,88 

0,12 

4,423 529 411 

-0,552 941 176 4 

50 

-2,882 352 941 

0,117 647 058 8 

4,434 389 140 

-0,542 986 425 4 
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N 

^10 


C ZO 

c « 

51 

-2,884 61S 385 

0,115 384 615 4 

4,444 847 606 

-0,533 381 712 7 

52 

-2,886 792 453 

0,113 207 547 2 

4,454 926 625 

-0,524 109 014 7 

53 

-2,888 888 889 

0,111 111 111 1 

4,464 646 465 

-0,515 151 515 2 

54 

-2,890 909 091 

0,109 090 909 1 

4,474 025 974 

-0.506 493 506 5 

55 

-2,892 857 143 

0,107 142 857 1 

4,483 082 706 

-0,498 120 300 7 

56 

-2.894 736 842 

0,105 263 157 9 

4 491 833 031 

-0 490 018 148 9 

57 

-2,896 551 724 

0,103 448 275 9 

4,500 292 227 

-0,482 174 167 2 

58 

-2,898 305 085 

0,101 694 915 3 

4,508 474 576 

-0 474 =576 271 2 

59 

-2,9 

0,1 

4,516 393 442 

-0,467 213 114 7 

60 

-2,901 639 344 

0,098 360 655 74 

4,524 061 343 

-0,460 074 034 9 

61 

-2,903 225 806 

0,096 774 193 54 

4,531 490 014 

-0,453 149 001 5 

62 

-2,004 761 905 

0,095 238 095 24 

4,538 690 476 

-0 446 428 571 4 

63 

-2,906 250 

0,093 750 

4.545 673 077 

-0.439 903 846 2 

64 

-2,907 692 307 

0,092 307 692 30 

4,552 447 552 

- 0.433 566 433 6 

65 

-2,909 090 909 

0,090 909 090 91 

4,559 023 067 

-0,427 408 412 5 

66 

-2,910 447 761 

0,089 552 238 80 

4.565 408 253 

-0,421 422 300 3 

67 

-2,911 764 706 

0,088 235 294 12 

4,571 611 254 

-0,415 601 023 1 

68 

-2,913 043 478 

0,086 956 521 74 

4,577 639 752 

- 0 409 937 888 2 

69 

-2,914 285 714 

0,085 714 285 72 

4,583 501 007 

- 0,404 426 559 4 

70 

-2,915 492 960 

0,084 507 042 26 

4,589 201 879 

- 0,399 061 032 9 

71 

-2,916 666 667 

0,083 333 333 33 

4.594 748 858 

- 0.393 8 35 616 4 

72 

-2,917 808 219 

0,082 191 780 82 

4,600 148 094 

- 0 388 744 909 4 

73 

-2,918 918 919 

0,081 081 081 08 

4 605 405 405 

- 0.383 783 783 8 

74 

-2,92 

0,08 

4,610 526 316 

- 0,378 947 368 4 

75 

-2,921 052 632 

0,078 947 368 42 

4,615 516 062 

- 0,374 231 032 1 

76 

-2,922 077 922 

0,077 922 077 92 

4,620 379 620 j 

- 0.369 630 369 6 

77 

-2,923 076 923 

| 0,076 923 076 92 ! 

4,625 121 713 

- 0,365 141 187 9 

78 

-2,924 050 633 

0,075 949 367 08 

4,629 746 835 

- 0,360 759 493 7 

79 

-2,925 

0,075 

4,634 259 260 

- 0,356 481 481 5 

80 

-2,925 925 926 

0,074 074 074 07 

4,638 663 052 

- 0,352 303 523 0 

81 

-2,926 829 268 

0,073 170 731 70 

4,642 962 091 

-0,348 222 156 8 

82 

-2,927 710 843 

0,072 289 156 62 

4,647 160 068 

- 0,344 234 079 1 

83 

-2,928 571 428 

0,071 428 571 42 

4 651 260 504 

- 0,340 336 1.34 4 

84 

-2,929 411 763 

0,070 588 235 30 

4,655 266 758 

- 0 336 525 307 8 

85 

-2,930 232 558 

0,069 767 441 86 

4,659 182 037 

- 0,332 798 716 9 

86 

-2.931 034 483 

0,068 965 517 24 

4 663 009 403 

- 0 329 153 604 9 

87 

-2,931 818 182 

0,068 181 818 18 

4,666 751 789 

- 0,325 587 334 1 

85 

1 -2,932 584 270 

0,067 415 730 34 

4,670 413 986 

- 0,322 097 378 4 

85 

> -2,933 333 333 

0,066 666 666 67 

4,673 992 674 

-0,318 681 318 7 

9C 

) -2,934 065 934 

0,065 934 065 93 

4,677 496 416 

-0,315 336 837 0 

91 

[ -2,934 782 608 

0,065 217 391 30 

4,680 925 664 

-0,312 061 711 0 

92 

! -2,935 483 871 

0,064 516 129 04 

4,684 282 773 

- 0,308 853 809 2 

92 

$ -2,936 170 213 

0,063 829 787 24 

4,687 569 990 

-0,305 711 086 3 

9 L 

1 -2,936 842 105 

0,063 157 894 74 

4,690 789 473 

~ 0,302 631 578 9 

9! 

5 -2,937 5 

0,062 5 

4,693 943 301 

-0,299 613 402 2 

9( 

S -2,938 144 330 

0,061 855 670 10 

4,697 033 453 

- 0,296 654 744 5 

9! 

7 -2,938 775 510 

0,061 224 489 80 

4,700 061 841 

-0,293 753 865 1 

91 

5 -2,939 393 939 

0,060 606 060 61 

4,703 030 303 

- 0,290 909 090 9 

9! 

? -2,94 

0,06 

4,705 940 594 

-0,288 118 811 9 

10i 

ass 

0 -2,940 594 060 

0,059 405 940 59 

4,708 794 409 

-0,285 381 479 3 
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N f«_ C so ' O/ | 


3 3 

4 2 - 0.2 0.8 -2 

S 1,428 571 429 - 0,5 1,5 - 2.5 


6 1,071 428 572 - 0,833 333 333 3 

7 0,833 333 333 3 -1,166 666 667 

8 0,666 666 666 7 -1,484 848 485 

9 0,545 454 545 5 -1,781 818 182 

10 0,454 545 454 5 -2.055 944 056 

11 0,384 615 384 6 - 2.307 692 308 

12 0,329 670 329 7 -2,538 461 538 

13 0,285 714 285 7 - 2,75 

14 0,25 - 2,944 117 648 

15 0,220 588 235 2 - 3,122 549 020 

16 0,196 078 43 1 4 -3,286 893 705 

17 0,175 438 596 5 - 3,438 596 491 

18 0,157 894 736 8 - 3,578 947 369 

19 0,142 857 142 9 - 3,709 090 909 

20 0,129 870 129 9 -3,830 039 526 

21 0.118 577 075 1 -3,942 687 747 

22 0,108 695 652 2 -4,047 826 087 

23 0,1 -4.146 153 846 

24 0,092 307 692 28 -4,238 290 598 

25 0,085 470 085 50 -4.324 786 325 

26 0.079 365 079 38 -4.406 130 268 

27 0,073 891 625 64 -4,482 758 621 

28 0,068 965 517 22 -4,555 061 179 

29 0,064 516 129 02 - 4,623 387 098 

30 0,060 483 870 96 -4,688 049 852 

31 0,056 818 181 82 - 4,749 331 550 

32 0,053 475 935 83 - 4,807 486 633 

33 0,050 420 168 07 - 4,862 745 098 

34 0,047 619 047 62 - 4,915 315 315 

35 0.045 045 045 05 -4,965 386 439 

36 0,042 674 253 20 - 5,013 130 540 

37 0,040 485 829 96 - 5,058 704 454 

38 0,038 461 538 46 -5,102 251 407 

39 0,036 585 365 86 -5.143 902 439 

40 0,034 843 205 57 -5,183 777 652 

41 0,033 222 591 36 -5,221 987 315 

♦2 0,031 712 473 57 -5,258 632 840 

13 0,030 303 030 30 -5,293 807 641 

M 0,028 985 507 24 -5,327 597 903 

15 0,027 752 081 14 -5,360 083 256 

16 0,026 595 744 68 -5,391 337 386 

17 0,025 510 204 08 -5,421 428 571 

18 0,024 489 795 92 -5,450 420 168 

19 0,023 529 411 76 -5,478 371 040 

30 0,022 624 434 39 -5,505 335 952 


2 -2.5 

2.333 333 333 -2,333 333 333 

2,545 454 545 - 2,121 212 121 

2,672 727 273 -1,909 090 909 

2,741 258 742 -1,713 286 714 

2,769 230 769 -1.538 461 538 

2,769 230 769 -1.384 615 384 

2,75 -1,25 

2.717 647 060 -1 1.32 352 941 

2.676 470 588 -1,029 411 765 

2.629 514 964 -0,9.39 112 487 0 

2,578 947 .368 -0,859 649 122 8 

2,526 315 790 - 0,789 473 684 2 

2.472 727 273 -0.727 272 727 3 

2,418 972 332 - 0,671 936 758 9 

2,365 612 648 - 0,622 529 644 .3 

2,31.3 04.3 478 - 0,578 260 869 6 

2.261 538 461 -0.5.38 461 5.38 4 

2,211 282 051 -0,502 564 102 5 

2.162 393 16.3 - 0.470 085 470 1 

2.114 942 528 - 0,440 61.3 026 8 

2.068 965 517 -0,41.3 79.3 10.3 4 

2,024 471 635 -0.389 .321 468 2 

1,981 451 613 - 0,366 935 484 0 

1,9.39 882 697 - 0,346 407 624 5 

1,899 732 620 - 0,327 540 106 9 

1,860 962 568 - 0,310 160 427 9 

1,823 529 412 - 0,294 117 647 1 

1,787 387 387 - 0,279 279 279 .3 

1,752 489 332 - 0,265 528 686 6 

1,718 787 614 - 0,252 762 884 4 

1,686 234 818 - 0,240 890 688 2 

1,654 784 240 - 0,229 831 144 5 

1,624 390 244 - 0,219 512 195 1 

1,595 008 508 - 0,209 869 540 6 

1,566 596 194 -0,200 845 666 0 

1,539 112 051 -0,192 389 006 4 

1,512 516 469 -0,184 453 227 9 

1,486 771 508 -0,176 996 608 1 

1,461 840 888 -0,169 981 498 6 

1,437 689 970 -0,16.3 37.3 860 2 

1,414 285 714 -0,157 142 857 1 

1,391 596 639 -0,151 260 504 2 

1,369 592 760 -0,145 701 357 4 

1,348 245 539 -0,140 442 243 7 
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N 

C'ZZ 

^30 

^31 

51 

0,021 770 682 15 

-5,531 365 909 

1,327 527 818 

52 

0,020 964 360 59 

- 5,556 508 480 

1,307 413 760 

53 

0,020 202 020 20 

- 5,580 808 081 

1,287 878 788 

54 

0,019 480 519 48 

-5,604 306 221 

1,268 899 522 

55 

0,018 796 992 48 

- 5,627 041 743 

1,250 453 721 

56 

0018 148 820 33 

- 5,649 051 031 

1,232 520 225 

57 

0,017 533 606 08 

- 5,670 368 206 

1,215 078 901 

58 

0,016 949 152 54 

-5,691 025 285 

1,198 110 586 

59 

0.016 393 442 62 

-5,711 052 353 

1,181 597 038 

60 

0,015 864 621 89 

-5,730 477 702 

1,165 520 888 

61 

0,015 360 983 10 

-5,749 327 957 

1,149 865 591 

62 

0,014 880 952 38 

-5,767 628 207 

1,134 615 385 

63 

0,014 423 076 92 

- 5,785 402 099 

1,119 755 245 

64 

0,013 986 013 99 

- 5,802 671 956 

1,105 270 849 

65 

0,013 568 521 03 

- 5,819 458 856 

1,091 148 535 

66 

0,013 169 446 88 

- 5,835 782 722 

1,077 375 272 

67 

0,012 787 723 79 

- 5,851 662 406 

1,063 938 619 

68 

0,012 422 360 25 

-5,867 115 739 

1,050 826 699 

69 

0,012 072 434 61 

- 5,882 159 623 

1,038 028 169 

70 

0,011 737 089 20 

- 5,896 810 084 

1,025 532 189 

71 

0,011 415 525 11 

-5,911 082 314 

1,013 328 397 

72 

0.011 106 997 41 

-5,924 990 742 

1,001 406 886 

73 

0,010 810 810 81 

-5,938 549 075 

0,989 758 179 3 

74 

0.010 526 315 79 

-5,951 770 335 

0,978 373 205 7 

75 

0,010 252 904 99 

-5,%4 066 912 

0,967 243 283 1 

76 

0,009 990 009 990 

- 5,977 250 597 

0,956 360 095 6 

77 

0,009 737 098 344 

-5 989 532 620 

0,945 715 676 8 

78 

0,009 493 670 886 

1 -0,001 523 676 

0,935 302 391 0 

79 

0,009 259 259 259 

-6,013 233 96<> 

0,925 112 917 8 

80 

0,009 033 423 666 

-6,024 673 219 

0,915 140 235 7 

81 

0,008 815 750 806 

-6,035 850 720 

0,905 377 608 0 

82 

0,008 605 851 978 

-6,046 775 336 

0,895 818 568 4 

83 

0,008 403 361 344 

-6,057 455 540 

0,886 456 908 3 

84 

0,008 207 934 343 

- 6,067 899 429 

0,877 286 664 4 

85 

0,008 019 246 192 

-6,078 114 748 

0,868 302 106 9 

86 

0,007 836 990 594 

-6,088 108 908 

0,859 497 728 2 

87 

0,007 660 878 450 

-6,097 889 003 

0,850 868 232 9 

88 

0,007 490 636 706 

-0,107 461 827 

0,842 408 527 8 

89 

0,007 326 007 326 

-6,116 833 891 

0,834 113 712 4 

90 

0,007 166 746 296 

-6,126 011 436 

f ',825 979 070 0 

91 

0,007 012 622 718 

-0,135 000 448 

0,818 000 059 7 

92 

0,006 8o3 417 982 

-6,143 806 668 

0,810 172 307 9 

93 

0,006 718 924 974 

-6,152 435 610 

0,802 491 601 3 

94 

0,006 578 947 368 

-6,160 892 566 

0,794 953 879 5 

95 

1 0,006 443 298 972 

-6,169 182 621 

0,787 555 228 2 

96 

0,006 311 803 074 

-6,177 310 662 

0,780 291 873 1 

97 

0,006 184 291 896 

-6,185 281 385 

0,773 160 173 2 

98 

0,006 060 606 060 

-6,193 099 310 

0,766 156 615 7 

99 

0,005 940 594 059 

- 6,200 768 783 

0,759 277 810 1 

100 

0,005 824 111 823 

-6,208 293 988 

0,752 520 483 4 



-0,135 4(52 022 3 

- 0,130 741 376 0 
-0,126 262 626 3 
-0,122 009 569 4 
-0,117 967 332 1 
-0,114 122 243 1 
-0,110 461 718 3 
-0,106 974 159 5 
-0,103 648 863 0 
-0,100 475 938 7 
-0,097 446 236 
-0,094 551 282 
-0,091 783 216 
-0,089 134 745 
-0,086 599 090 
-0,084 169 943 
-0,081 841 432 
-0,079 608 083 
-0,077 464 788 
-0,075 406 778 
-0,073 429 593 
-0,071 529 063 
-0,069 701 280 

- 0,067 942 583 

- 0,066 249 539 
-0,064 618 925 
-0,063 047 711 
-0,061 533 052 
-0,060 072 267 

- 0,058 662 835 
-0,057 302 380 
-0,055 988 660 
-0,054 719 562 

- 0,053 493 089 

- 0,052 307 355 
-0,051 160 579 
-0,050 051 072 
-0,048 977 239 
-0,047 937 569 

- 0,046 930 628 
-0,045 955 059 
-0,045 009 572 
-0,044 092 945 
-0,043 204 015 
-0,042 341 678 

- 0,041 504 886 

- 0,040 692 640 
-0,039 903 990 

- 0,039 138 031 45 
-0,038 393 902 22 
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"77 


<Uo 

c 

41 


C 




4 

5 

4 

2,5 




0,071 

428 

571 43 

-0,357 

142 

857 2 

1,071 

428 

571 


6 

1,666 

666 

667 


0,214 

285 

714 3 

-0,857 

142 

857 2 

1,928 

571 

429 


7 

1.166 

666 

667 


0,409 

090 

909 1 

-1,363 

636 

364 

2,454 

545 

455 


8 

0,848 

484 

848 

5 

0,636 

363 

636 4 

-1,818 

181 

818 

2.727 

272 

727 


9 

0,636 

363 

636 

4 

0,881 

118 

881 1 

-2,202 

797 

203 

2,832 

167 

832 


10 

0,489 

510 

489 

6 

1,132 

867 

133 

-2,517 

482 

517 

2,832 

167 

832 


11 

0.384 

615 

384 

6 

1,384 

615 

385 

-2,769 

230 

769 

2,769 

230 

769 


12 

0,307 

692 

307 

6 

1,631 

868 

132 

-2,967 

032 

967 

2,670 

329 

670 


13 

0,25 




1,871 

848 

739 

-3,119 

747 

899 

2,552 

521 

008 


14 

0.205 

882 

353 

0 

2,102 

941 

177 

-3,235 

294 

118 

2,426 

470 

589 


15 

0,171 

568 

627 

5 

2,324 

303 

405 

-3,320 

433 

436 

2,298 

761 

610 


16 

0,144 

478 

'844 

2 

2,535 

603 

715 

-3,380 

804 

953 

2,173 

374 

613 


17 

0,122 

807 

017 

5 

2,736 

842 

104 

-3,421 

052 

630 

2,052 

631 

578 


18 

0,105 

263 

157 

9 

2,928 

229 

666 

-3,444 

976 

077 

1,937 

799 

043 


19 

0,090 

909 

090 

91 

3,110 

107 

283 

-3,455 

674 

7S9 

1,829 

474 

872 


20 

0,079 

051 

383 

40 

3,282 

891 

022 

-3,455 

674 

760 

1,727 

837 

380 


21 

0,069 

169 

960 

48 

3,447 

035 

573 

-3,447 

035 

573 

1,632 

806 

324 


22 

0.060 

869 

565 

22 

3,603 

010 

033 

-3,431 

438 

127 

3,544 

147 

157 


23 

0,053 

846 

153 

84 

3,751 

282 

051 

-3,410 

256 

410 

1,461 

538 

461 


24 

0,047 

863 

247 

86 

3,892 

307 

693 

-3,384 

615 

385 

1,384 

615 

385 


25 

0,042 

735 

042 

74 

4,026 

525 

199 

-3,355 

437 

666 

1,312 

997 

347 


26 

0,038 

314 

176 

24 

4,154 

351 

390 

-3,3 23 

481 

117 

1,246 

305 

419 


27 

0,034 

482 

758 

62 

4,276 

179 

883 

-3,289 

369 

141 

1,184 

172 

891 


28 

0,031 

145 

717 

46 

4,392 

380 

423 

-3,253 

615 

128 

1,126 

251 

391 


29 

0.028 

225 

806 

46 

4,503 

299 

121 

-3,216 

642 

229 

1,072 

214 

076 


30 

0,025 

659 

824 

04 

4,609 

259 

101 

-3,178 

799 

380 

1,021 

756 

943 


31 

0,023 

395 

721 

92 

4,710 

561 

498 

-3,140 

374 

332 

0,974 

598 

930 

5 

32 

0,021 

390 

374 

34 

4,807 

486 

632 

-3,101 

604 

278 

0,930 

481 

283 

5 

33 

0,019 

607 

843 

14 

4,900 

295 

253 

-3,062 

684 

533 

0,889 

166 

477 

4 

34 

0,018 

018 

018 

02 

4,989 

229 

831 

-3,023 

775 

655 

0,850 

436 

903 

0 

35 

0,016 

595 

542 

91 

5,074 

515 

.814 

-2,985 

009 

302 

0,814 

093 

446 

0 

36 

0,015 

318 

962 

69 

5,156 

362 

841 

-2,946 

493 

052 

0,779 

954 

043 

1 

37 

0,014 

170 

040 

49 

5,234 

965 

933 

-2,908 

314 

407 

0,747 

852 

276 

1 

38 

0,013 

133 

208 

26 

5,310 

506 

567 

-2,870 

544 

090 

0,717 

636 

022 

5 

39 

0,012 

195 

121 

95 

5,383 

153 

716 

-2,833 

238 

798 

0,689 

166 

194 

0 

40 

0,011 

344 

299 

49 

5,453 

064 

803 

-2,796 

443 

488 

0,662 

315 

563 

1 

41 

0,010 

570 

824 

52 

5,520 

386 

590 

-2,760 

193 

295 

0,636 

967 

683 

5 

42 

0,009 

866 

102 

890 

5,585 

255 

997 

-2,724 

515 

121 

0,613 

015 

902 

2 

43 

0,009 

222 

661 

396 

5,647 

800 

858 

-2,689 

428 

980 

0,590 

362 

459 

0 

44 

0,008 

633 

980 

882 

5,708 

140 

610 

-2,654 

949 

121 

0,568 

917 

668 

8 

45 

0,008 

094 

357 

076 

5,766 

386 

943 

-2,621 

084 

974 

0,548 

599 

180 

6 

46 

0,007 

598 

784 

194 

5,822 

644 

377 

-2,587 

841 

945 

0,529 

331 

307 

0 

47 

0,007 

142 

857 

142 

5,877 

010 

804 

-2,555 

222 

089 

0,511 

044 

417 

7 

48 

0,006 

722 

689 

076 

5,929 

577 

985 

-2,523 

224 

675 

0,493 

674 

392 

9 

49 

0,006 

334 

841 

628 

5,980 

431 

999 

-2,491 

846 

666 

0,477 

162 

127 

6 

50 

0,005 

976 

265 

688 

6,029 

653 

662 

-2,461 

083 

127 

0,461 

453 

086 

4 
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^41 

^42 

-2,430 927 563 

0,446 496 899 3 

- 2,401 372 213 

0,432 246 998 3 

-2,372 408 293 

0,418 660 287 1 

-2,344 026 210 

0,405 696 844 1 

-2,316 215 730 

0,393 319 652 2 

-2,288 966 132 

0,381 494 355 4 

-2,262 266 338 

0,370 189 037 1 

-2,236 105 011 

0,359 374 019 7 

-2,210 470 651 

0,349 021 681 7 

-2,185 351 666 

0,339 106 293 1 

-2,160 736 440 

0,329 603 863 8 

-2,136 613 387 

0,320 492 008 1 

-2,112 970 984 

0,311 749 817 2 


0,303 357 748 5 
0,295 297 518 5 
0,287 552 009 8 
0,280 105 183 5 
0,272 941 999 8 
0,266 048 344 9 
0,259 410 964 1 
0,253 017 401 0 
0,246 855 940 3 
0,240 915 556 7 
0,235 185 £66 8 
0,229 657 085 1 
0,224 319 983 8 
0,219 165 855 2 
0,214 186 477 3 
0,209 374 081 7 
0,204 721 324 3 
0,200 221 258 1 
0,195 867 307 4 
0,191 653 245 1 
0,187 573 170 2 
0,183 621 488 3 
0,179 792 892 1 
0,176 082 344 6 
0,172 485 062 6 
0,168 996 501 4 
0,165 612 341 2 
0,162 328 473 4 
0,159 140 989 0 
0,156 046 166 7 
0,153 040 462 0 
0,150 120 498 1 
0,147 283 055 7 
0,144 525 064 8 
















































































AM 


approxmatton by orthogonal, polynomials 


'44 


5-2,5 

6 -3 

7 -2,863 636 36 4 

8 -2,545 454 545 

9 - 2,202 797 203 

10 - 1,888 111 888 

11 -1,615 384 615 

12 -1.384 615 385 

13 -1,191 176 471 

14 -1,029 411 765 

15 -0,893 962 848 2 

16 -0,780 185 758 4 

17 -0,684 210 526 1 

18 -0,602 870 813 5 

19 -0,533 596 837 7 

20 -0,474 308 300 4 

21 -0,423 320 158 1 

22 -0,379 264 214 0 

23 - 0,341 025 641 0 

24 -0,307 692 307 7 

25 - 0,278 514 588 8 

26 -0,252 873 563 3 

27 -0.230 255 839 9 

28 -0,210 233 592 9 

29 -0.192 448 680 4 
•30 -0,176 599 965 5 

31 -0,162 433 155 1 

32 -0,149 732 620 3 

33 -0.138 314 785 4 

34 -0,128 022 759 6 

35 -0,118 721 960 9 

36 -0,110 296 531 4 

37 -0,102 646 390 8 

38 -0,095 684 803 00 

39 -0,089 336 358 49 

40 -0,083 535 296 24 

41 -0,078 224 101 48 

42 -0,073 352 330 17 

43 -0,068 875 620 22 

44 -0,064 754 856 61 

45 -0,060 955 464 51 

46 -0,057 446 808 51 

47 -0,054 201 680 67 

48 -0,051 195 862 96 

49 -0,048 407 752 07 
50| -0,045 818 036 94 


- 0.023 809 523 

- 0,083 333 333 

- 0,179 487 179 

- 0,307 692 307 

- 0,461 538 461 

- 0,634 615 384 

- 0,821 266 968 
-1,016 806 723 

- 1,217 492 260 
-1,420 407 637 
-1,623 323 013 
-1,824 561 403 

- 2,022 883 295 

- 2217 391 304 

- 2,407 453 416 

- 2,592 642 141 

- 2,772 686 734 

- 2,947 435 897 

- 3.116 828 765 
-3,280 872 384 

- 3,439 624 274 

- 3,593 178 929 

- 3.741 657 397 

- 3,885 199 241 

- 4,023 956 357 

- 4,158 088 235 

- 4,287 758 346 
-4,413 131 398 

- 4,534 371 270 

- 4,651 639 494 
-4,765 094 116 

- 4,874 888 909 
-4,981 172 826 

- 5,084 089 620 
-5,183 777 652 
-5,280 369 782 
-5,373 993 360 
-5,464 770 274 

- 5,552 817 057 

- 5,638 245 011 
-5,721 160 378 
-5,801 664 512 

- 5,879 854 060 
' 5,955 821 167 
‘6,029 653 660 


1,142 857 142 
1.416 666 666 
1,769 230 769 
1.153 846 154 
1.538 461 538 
,903 846 154 
1,239 819 005 
,542 016 807 
.809 597 523 
,043 730 650 
.246 646 027 
,421 052 631 
,569 794 051 
,695 652 173 
,801 242 236 
888 963 211 
960 981 ‘048 
019 230 768 
065 428 824 
101 090 480 
127 549 129 
145 975 687 

57 397 108 
62 713 473 
62 713 472 

58 088 235 
49 443 561 
37 310 686 
22 155 700 
34 387 789 
14 366 385 
)2 407 424 
>8 788 778 
3 754 963 
7 521 270 
0 277 336 

2 190 263 

3 407 339 

4 058 412 
1 257 962 
I 106 919 
! 694 247 
I 098 336 
1 388 229 

624 690 
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APPROXIMATION BY ORTHOGONAL POLYNOMIALS 


-0,5 

-1,166 666 667 
-1,794 871 795 
-2,307 692 308 
-2,692 307 692 
111-2,961 538 461 
-3,135 746 607 
-3,235 294 117 
-3,277 863 777 
-3,277 863 777 
-3,246 646 027 
-3,192 982 456 | 
-3,123 569 794 j 
-3,043 478 260 
-2,956 521 740 
-2,865 551 840 
-2,772 686 734 
-2,679 487 179 
-2,587 091 070 

25 -2,496 315 945 

26 -2,407 736 992 

27 -2,321 746 385 

28 -2,238 598 443 

29 -2,158 444 023 

30 -2,081 356 736 

31 -2,007 352 941 

32 -1,936 406 995 

33 -1,868 462 890 

34 -1,803 443 119 

35 -1,741 255 425 

36 -1,681 797 923 

37 -1,624 962 970 

38 -1,570 640 080 

39 -1,518 718 094 

40 -1,469 086 784 

41 -1,421 638 018 

42 -1,376 266 592 

43 -1*332 870 799 

44 -1,291 352 804 

45 -1,251 618 871 

46 -1,213 579 474 

47 -1,177 149 321 

48 -1,142 247 320 

49 -1,108 796 494 

50 -1,076 723 868 


1.333 333 333 

2.333 333 333 
2,871 794 872 
3,076 923 077 
3,076 923 077 
2,961 538 461 
2,787 330 317 
2,588 235 294 
2,383 900 929 
2,185 242 518 
1,997 936 016 
1,824 561 403 
1,665 903 890 j 
1,521 739 130 
1,391 304 348 
1,273 578 596 
1,167 447 046 
1,071 794 872 
0,985 558 502 8 
0,907 751 252 6 
0,837 473 736 3 
0,773 915 461 6 
0,716 351 501 7 
0,664 136 622 4 
0,616 698 292 2 
0,573 529 411 7 
0,534 181 240 0 
0.498 256 770 8 
0,465 404 675 9 
0,435 313 856 4 
0,407 708 587 5 
0,382 344 228 2 
0,359 003 446 9 
0,337 492 909 8 
0,317 640 385 7 
0,299 292 214 3 
0,282 311 095 8 
0,266 574 159 7 
0,251 971 278 8 
0,238 403 594 5 
0,225 782 227 8 
0,214 027 149 3 
0,203 066 190 2 
0,192 834 172 8 
0.183 272 147 7 


-3 

-3,5 

-3,230 769 230 
-2,769 230 769 
-2,307 692 308 
-1,903 846 154 
-1,567 873 304 
-1,294 117 647 
-1,072 755 418 
-0,893 962 848 4 
-0,749 226 006 1 
-0,631 578 947 3 
-0,535 469 107 6 
-0,456 521 739 1 
-0,391 304 347 9 
-0,337 123 745 9 
-0,291 861 761 4 
-0,253 846 153 8 
-0,221 750 663 1 
-0,194 518 125 5 
-0,171 301 446 1 
-0,151 418 242 5 
-0,134 315 906 6 
-0,119 544 592 0 
-0,106 736 242 9 
-0,095 588 235 29 
-0,085 850 566 43 
-0,077 315 705 81 
-0,069 810 701 38 
-0,063 190 721 08 
-0,057 334 020 11 
-0,052 137 849 29 
-0,047 515 162 09 
-0,043 391 945 55 
-0,039 705 048 21 
-0,036 400 404 45 
-0,033 431 577 14 
-0 030 758 556 89 
-0,028 346 768 87 
-0 026 166 248 18 
-0 024 190 952 97 
-0 022 398 190 05 
-0,020 768 133 09 
"0 019 283 417 28 
-0017 928 797 06 


6 

3,5 

2,153 846 
1,384 615 
0,923 076 
0,634 615 
0,447 963 
0,323 529 
0,238 390 
0,178 792 
0,136 222 
0,105 263 
0,082 379 
0,065 217 
0,052 173 
0,042 140 
0,034 336 
0,028 205 
0,023 342 
0,019 451 
0,016 314 
0,013 765 
0,011 679 
0,009 962 
0,008 538 
0,007 352 
0,006 359 
0,005 522 
0,004 814 
0,004 212 
0,003 698 
0,003 258 
0,002 879 
0,002 552 
0,002 268 
0.002 022 
0,001 807 
0,001 618 
0,001 45 * 
0,001 308 
0,001 180 
0,001 060 
0 000 965 
0,000 876 
0 000 7% 


154 
385 
923 1 
384 5 
801 0 
411 7 
092 9 
569 7 
910 2 
157 9 
862 71 
391 29 
913 05 
468 24 
677 82 
128 20 
175 07 
812 55 
423 43 
294 77 
644 05 
049 336 
899 431 
941 176 
300 476 
550 415 
531 130 
714 739 
969 040 
615 581 
700 793 
467 385 
859 898 
244 692 
112 278 
871 415 
680 455 
312 409 
046 487 
580 478 
959 678 4 
518 967 5 
8)5 424 9 
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APPROXIMATION BY ORTHOGONAL POLYNOMIALS 


N 

£>60 

C 61 

C 6Z 

^63 

7 

0,007 S7S 757 576- 

-0,053 030 303 Q4 

0,212 121 212 1 

- 0,636 

363 

636 

4 

8 

0,030 303 030 303- 

0,181 818 181 8 

0,606 060 606 1 

- 1,454 

545 

455 


9 

0,072 727 272 73 

-0,381 818 181 8 

1,090 909 091 

- 2,181 

818 

182 


10 

0,136 363 636 4 

-0,636 363 636 4 

1,590 909 091 

- 2,727 

272 

727 


11 

0.220 588 235 3 

0,926 470 588 2 

2,058 823 529 

- 3,088 

235 

294 


12 

0,323 529 411 7 

1,235 294 118 

2,470 588 235 

- 3,294 

117 

647 


13 

0,442 724 458 2 

- 1,549 535 604 

2,817 337 461 

- 3,380 

804 

953 


14 

0,575 541 795 6 

1,859 442 724 

3,099 071 207 

- 3,380 

804 

953 


13 

0,719 427 244 5 

- 2,158 281 734 

3,320 433 436 

- 3,320 

433 

436 


16 

0.872 033 023 9 

-2,441 692 467 

3,488 132 095 

- 3,219 

814 

242 


17 

1,031 273 836 

-2,707 093 821 

3,609 458 427 

- 3,093 

821 

509 


18 

1,195 340 129 

- 2,953 193 260 

3,691 491 575 

- 2,953 

193 

2o0 


19 

1,362 687 747 

-3,179 604 743 

3,740 711 462 

- 2,805 

53? 

597 


20 

1,532 015 810 

- 3,386 561 265 

3,762 845 830 

- 2,656 

126 

482 


21 

1.702 239 789 

-3,574 703 557 

3,762 845 850 

- 2.508 

563 

900 


22 

1,872 463 768 

-3,744 927 537 

3,744 927 537 

- 2,365 

217 

392 


23 

2,041 954 023 

-3,898 275 863 

3,712 643 679 

- 2,227 

586 

207 


24 

22210 114 943 

-4,035 862 069 

3,668 965 518 

i 

rsj 

o 

£ 

551 

724 


25 1 

2,376 467 681 

-4,158 818 441 

3,616 363 862 

- 1,972 

562 

106 


26 

2,540 631 565 

-4,268 261 030 

3,556 884 191 

- 1,855 

765 

6(n 


27 

2,702 308 120 

-4,365 266 963 

3,492 213 570 

- 1,746 

106 

785 


28 

2,861 267 421 

-4,450 860 433 

3,423 738 795 

- 1,643 

394 

621 


29 

3,017 336 553 

-4,526 004 830 

3,352 596 170 

- 1,547 

352 

079 


30 

3,170 389 857 

-4,591 599 103 

3,279 713 645 

- 1,457 

650 

509 


31 

3,320 340 729 

-4,648 4 77 020 

3,205 846 220 

- 1,373 

934 

095 


32 

3,467 134 739 

-4,697 408 357 

3,131 605 571 

- 1,295 

836 

788 


33 

3,610 743 870 

-4,739 101 330 

3,057 484 729 

- 1,22 2 

993 

892 


34 

3,751 161 688 

-4,774 205 785 

2,983 878 615 

-1,155 

049 

787 


35 

3.888 399 310 

-4,803 316 795 

2,911 101 088 

- 1,091 

662 

908 


36 

4,022 482 045 

-4,826 978 454 

2,839 399 091 

- 1,032 

508 

760 

2 

37 

4,153 446 577 

-4,845 687 673 

2,768 964 385 

- 0,977 

281 

547 

6 

38 

4,281 338 627 

-4,859 897 901 

2,699 943 279 

-0,925 

694 

838 

4 

39 

4,406 211 004 

-4,870 022 689 

2,632 444 697 

- 0,877 

481 

565 

5 

40 

4,528 121 979 

-4,876 439 055 

2,566 546 871 

- 0,832 

393 

579 

7 

41 

4,647 133 947 

-4,879 490 645 

2,502 302 895 

- 0,790 

200 

914 

1 

42 

4,763 312 297 

-4,879 490 645 

2,439 745 323 

- 0,750 

690 

868 

5 

43 

4,876 724 494 

-4,876 724 494 

2^378 889 997 

- 0,713 

666 

999 

1 

44 

4,987 439 320 

-4,871 452 359 

2,319 739 218 

-0,678 

948 

063 

9 

45 

5,095 526 240 

-4,863 911 411 

2,262 284 377 

-0,646 

366 

964 

9 

46 

5,201 054 890 

-4,854 317 898 

2,206 508 135 

-0,615 

769 

712 

2 

47 

5,304 094 656 

-4,842 869 034 

2,152 386 237 

-0,587 

014 

428 

4 

48 

1 5,404 714 339 

-4,829 744 728 

2,099 889 012 

-0,559 

970 

403 

3 

4S 

> 5,502 981 872 

-4,815 109 138 

2,048 982 612 

-0,534 

517 

203 

1 

SC 

> 5,598 964 115 

-4,799 112 098 

1,999 630 041 

-0,510 

543 

840 

2 
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948 000 3 
635 329 1 
518 772 8 
517 974 1 
558 714 8 
572 408 8 
495 634 0 
269 707 6 

840 296 2 
157 059 8 
173 326 8 
845 797 9 
134 273 0 
001 405 4 
412 475 1 
335 180 8 
739 451 7 

597 273 7 
882 531 8 
570 864 0 
639 529 2 
067 285 0 
834 275 9 
921 930 8 
312 868 9 
990 812 7 
940 509 2 
147 655 8 

598 833 4 
281 444 2 
183 654 6 
294 341 6 
603 045 2 
099 922 6 
775 706 9 
621 668 2 
629 578 8 
791 679 4 
100 648 8 
549 576 0 
131 933 4 

841 552 6 
672 601 6 
619 564 3 
677 220 3 
840 626 8 
105 101 83 
466 208 58 













































































348 APPROXIMATION BY ORTHOGONAL POLYNOMIALS 


N 

C&4 

C 65 

C66 

^70 

7 

1,590 

909 

091 


3,5 




7,0 








8 

2,727 

272 

727 


-4,0 




4,0 




-0,002 

331 

002 

331 

9 

3,272 

727 

273 


-3,6 




2,4 




- 0,010 

489 

510 

49 

10 

3,409 

090 

909 


-3,0 




1,5 




- 0,027 

766 

351 

30 

11 

3,308 

823 

529 


- 2,426 

470 

588 


0,970 

588 

235 

2 

- 0,056 

561 

085 

97 

12 

3,088 

235 

294 


- 1,941 

176 

470 


0,047 

058 

823 

5 

- 0,098 

237 

67S 

65 

13 

2,817 

337 

461 


- 1,549 

535 

604 


0,442 

724 

458 

2 

- 0,153 

250 

774 

0 

14 

2,535 

603 

715 


- 1,239 

628 

483 


0,309 

907 

120 

7 

- 0,221 

362 

229 

1 

15 

2,263 

931 

888 


-0,996 

130 

030 

9 

0,221 

362 

229 

1 

- 0,301 

857 

585 

2 

16 

2,012 

383 

901 


-0,804 

953 

560 

5 

0,160 

990 

712 

1 

- 0,393 

727 

285 

0 

17 

1,784 

897 

025 


-0,654 

462 

242 

3 

0,118 

993 

135 

0 

- 0,495 

804 

729 

2 

18 

1,582 

067 

818 


- 0,535 

469 

107 

6 

0,089 

244 

851 

26 

-0,606 

864 

988 

6 

19 

1,402 

766 

798 


- 0,440 

869 

565 

2 

0,067 

826 

086 

95 

- 0,725 

691 

699 

5 

20 

1,245 

059 

289 


- 0,365 

217 

391 

3 

0,052 

173 

913 

05 

- 0,851 

119 

894 

6 

21 

1,106 

719 

368 


-0,304 

347 

826 

1 

0,040 

579 

710 

14 

- 0,982 

061 

416 

9 

22 

0,985 

507 

246 

5 

- 0,255 

072 

463 

8 

0,031 

884 

057 

98 

- 1,117 

518 

164 


23 

0,879 

310 

344 

9 

- 0,214 

942 

528 

8 

0,025 

287 

356 

33 

- 1,256 

587 

091 


24 

0,786 

206 

896 

7 

- 0,182 

068 

965 

5 

0,020 

229 

885 

06 

- 1,398 

459 

827 


25 

0,704 

486 

466 

6 

- 0,154 

987 

022 

6 

0,016 

314 

423 

44 

- 1,542 

418 

927 


26 

0,632 

647 

385 

8 

- 0,132 

554 

690 

4 

0,013 

255 

469 

04 

- 1,687 

832 

159 


27 

0,569 

382 

647 

3 

- 0,113 

876 

529 

5 

0,010 

845 

383 

76 

- 1,834 

145 

783 


28 

0,513 

560 

819 

2 

- 0,098 

246 

417 

59 

0,008 

931 

492 

508 

-1,980 

877 

446 


29 

0,464 

205 

623 

6 

- 0,085 

104 

364 

32 

0,007 

400 

379 

507 

-2,127 

609 

108 


30 

0,420 

476 

108 

3 

- 0,074 

003 

795 

07 

0,006 

166 

982 

922 

- 2,273 

980 

250 


31 

0,381 

648 

359 

6 

- 0,064 

586 

645 

47 

0,005 

166 

931 

638 

-2,419 

681 

502 


32 

0,347 

099 

139 

7 

“ 0,056 

564 

304 

24 

0,004 

351 

100 

326 

- 2,564 

448 

772 


33 

0,316 

291 

523 

7 

-0,049 

702 

953 

72 

0,003 

681 

700 

276 

-2,708 

057 

903 


34 

0,288 

762 

446 

6 

-0,043 

812 

233 

28 

0,003 

129 

445 

235 

- 2,850 

319 

857 


35 

0,264 

111 

993 

9 

-0,038 

736 

425 

77 

0,002 

671 

477 

639 

- 2,991 

076 

393 


36 

0,241 

994 

240 

7 

- 0,034 

347 

569 

64 

0,002 

289 

837 

976 

- 3,130 

196 

225 


37 

0,222 

109 

442 

6 

- 0,030 

540 

048 

36 

0,001 

970 

325 

701 

- 3,267 

571 

608 


38 

0,204 

197 

390 

8 

-0,027 

226 

318 

78 

0,001 

701 

644 

923 

- 3,403 

115 

319 


39 

0,188 

031 

764 

0 

- 0,024 

333 

522 

41 

0,001 

474 

758 

934 

- 3,536 

757 

996 


40 

0,173 

415 

329 

1 

- 0,021 

800 

784 

23 

0,001 

282 

399 

072 

-3,668 

445 

794 


41 

0,160 

175 

861 

0 

- 0,019 

5 77 

049 

67 

0,001 

118 

688 

553 

- 3,798 

138 

322 


42 

0,148 

162 

671 

4 

- 0,017 

619 

344 

71 

0,000 

978 

852 

483 8 

- 3,925 

806 

837 


43 

0,137 

243 

653 

7 

- 0,015 

891 

370 

42 

0,000 

858 

992 

995 9 

- 4,051 

432 

657 


44 

0,127 

302 

762 

0 

- 0,014 

362 

362 

89 

0,000 

755 

913 

836 4 

- 4,175 

005 

764 


45 

0,118 

237 

859 

4 

- 0,013 

006 

164 

54 

0,000 

666 

982 

796 8 

-4,296 

523 

605 


46 

0,109 

958 

877 

2 \ 

- 0,011 

800 

464 

87 

0,000 

590 

023 

243 4 

- 4,415 

990 

001 


47 

0,102 

386 

237 

5; 

- 0,010 

726 

177 

26 

0,000 

523 

228 

159 1 

-4,533 

414 

237 


48 

0,095 

449 

500 

56 

-0,009 

766 

925 

638 

0,000 

465 

091 

69 7 \ 

-4,648 

810 

235 


49 

0,089 

086 

200 

52 

“ 0,008 

908 

620 

052 

0,000 

414 

354 

421 0 

-4,762 

195 

851 


SO 

0,083 

240 

843 

52 

- 0,008 

139 

104 

699 

0,000 

369 

959 

304 5 

-4,873 

592 

245 








CHARLES JORDAN 349 


N 

^64 

^ 6 ? 

Q6 

C 70 

51 

0,077 864 

042 

61 

- 0,007 

447 

864 

945 


0,000 

331 

016 

219 

8 

- 4,983 

023 

346 

52 

0,072 911 

770 

17 

- 0,006 

825 

782 

740 


0,000 

296 

773 

162 

6 

- 5,090 

515 

375 

53 

0,068 344 

710 

13 

- 0,006 

264 

931 

761 


0,000 

266 

592 

840 

9 

- 5,196 

096 

435 

54 

0,064 127 

696 

11 

- 0,005 

758 

405 

365 


0,000 

239 

933 

556 

9 

- 5,299 

796 

149 

55 

0,060 229 

222 

77 

- 0,005 

300 

171 

603 


0,000 

216 

333 

534 

8 

- 5,401 

645 

355 

56 

0,056 621 

020 

50 

- 0,004 

884 

950 

788 


0,000 

195 

398 

031 

5 

- 5,501 

675 

824 

57 

0,053 277 

684 

05 

- 0,004 

508 

111 

727 


0,000 

176 

788 

695 

2 

- 5,599 

920 

035 

58 

0,050 176 - 

348 

27 

- 0,004 

165 

583 

630 


0,000 

160 

214 

755 

0 

- 5,696 

410 

966 

59 

0,047 296 

404 

03 

- 0,003 

853 

781 

069 


0,000 

145 

425 

700 

7 

- 5,791 

181 

915 

60 

0,044 619 

249 

08 

1 - 0,003 

569 

539 

927 


0,000 

132 

205 

182 

5 

- 5,884 

266 

354 

61 

0,042 128 

069 

32 

- 0,003 

310 

062 

589 


0,000 

120 

365 

912 

3 

- 5,975 

697 

797 

62 

0,039 807 

646 

28 

- 0,003 

072 

870 

941 


0,000 

109 

745 

390 

7 

- 6,065 

509 

681 

63 

0,037 644 

187 

24 

- 0,002 

855 

765 

929 


0,000 

100 

202 

313 

3 

- 6,153 

735 

274 

64 

0,035 625 

175 

68 

- 0,002 

656 

792 

763 


0,(100 

091 

613 

543 

54 

- 6,240 

407 

602 

65 

0,033 739 

238 

74 

- 0,002 

474 

210 

841 


0,000 

083 

871 

553 

94 

- 6,325 

559 

363 

66 

i 0,031 976 

029 

94 

- 0,002 

306 

4(>7 

733 


0 000 

076 

882 

257 

78 

- 6,409 

222 

878 

67 

0 , 030-326 

125 

20 1 

- 0,002 

152 

176 

627 


0.000 

070 

563 

168 

10 

- 6.491 

430 

043 

68 

0,028 780 ' 

930 

52 

- 0,002 

010 

096 

735 


0 000 

064 

841 

830 

15 

- 6,572 

212 

283 

69 

0,027 332 

599 

82 

- 0,001 

879 

116 

238 


0,000 

059 

654 

483 

73 

- 6,051 

600 

525 

70 

0,025 973 

961 

82 

- 0,001 

758 

237 

415 


0.000 

054 

944 

919 

23 

- 6.729 

625 

164 

71 

0,024 698 

454 

77 1 

- 0,001 

646 

563 

651 


0,000 

050 

6(;3 

496 

95 

- 6,806 

316 

049 

72 

0,023 500 

068 

20 

- 0,001 

543 

288 

061 


0,000 

046 

766 

304 

88 

- 6.881 

702 

461 

73 

0,022 373 

290 

86 

- 0,001 

447 

683 

526 


0.000 

043 

214 

433 

63 

- 6,955 

813 

102 

74 

0,021 313 

064 

02 

- 0,001 

359 

093 

937 


0,000 

039 

973 

351 

10 

- 7,028 

676 

091 

75 

0,020 314 

739 

67 

- 0,001 

276 

926 

493 


0,000 

037 

012 

362 

13 

- 7,100 

318 

949 

76 

0,019 374 

042 

99 

- 0,001 

200 

644 

918 


0,000 

034 

304 

140 

51 

- 7,170 

768 

605 

77 

0,018 487 

038 

61 

- 0,001 

129 

763 

471 


0,000 

031 

824 

323 

12 

- 7,240 

051 

394 

78 

0,017 650 

100 

25 

- 0,001 

063 

841 

659 


0,000 

029 

551 

157 

19 

- 7.308 

193 

054 

79 

0,016 859 

883 

34 

- 0,001 

002 

479 

550 


0,000 

027 

465 

193 

15 

- 7,375 

218 

737 

80 

0,016 113 

300 

42 

- 0,000 

945 

313 

624 

6 

0,000 

025 

549 

016 

88 

- 7,441 

153 

002 

81 

0,015 407 

498 

82 

- 0,000 

892 

013 

089 

6 

0,000 

023 

787 

015 

72 

- 7,506 

019 

842 

82 

0,014 739 

840 

54 

- 0,000 

842 

276 

602 

2 

0,000 

022 

165 

173 

74 

- 7,569 

842 

663 

83 

0,014 107 

884 

04 

- 0,000 

795 

829 

355 

9 

0,000 

020 

670 

892 

36 

- 7,632 

644 

320 

84 

0,013 509 

367 

74 

- 0,000 

752 

420 

481 

8 

0,000 

019 

292 

832 

87 

- 7,694 

447 

109 

85 

0,012 942 

195 

08 

- 0,000 

711 

820 

729 

4 

0,000 

018 

020 

777 

96 

- 7,755 

272 

780 

86 

0,012 404 

420 

87 

- 0,000 

673 

820 

393 

0 

0,000 

016 

845 

509 

83 

- 7,815 

142 

541 

87 

0,011 894 

239 

05 

- 0,000 

638 

227 

461 

1 

0,000 

015 

758 

702 

74 

- 7,874 

077 

094 

88 

0,011 409 

971 

37 

- 0,000 

604 

865 

952 

2 

0,000 

014 

752 

828 

10 

- 7,932 

096 

611 

89 

0,010 950 

057 

25 

- 0,000 

573 

574 

427 

4 

0,000 

013 

821 

070 

54 

- 7,989 

220 

756 

90 

0,010 513 

044 

42 

- 0,000 

544 

204 

652 

3 

0,000 

012 

957 

253 

63 

- 8,045 

468 

731 

91 

0,010 097 

580 

47 

- 0,000 

510 

620 

395 

9 

0,000 

012 

155 

774 

02 

- 8,100 

859 

228 

92 

0,009 702 

405 

050 

- 0,000 

490 

696 

347 

4 

0,000 

Oil 

411 

542 

96 

- 8,155 

410 

463 

93 

0,009 326 

342 

835 

- 0,000 

466 

317 

141 

7 

0,000 

010 

719 

934 

29 

- 8,209 

140 

228 

94 

0,008 968 

297 

029 

- 0,000 

443 

376 

482 

3 

0,000 

010 

076 

738 

23 

- 8,262 

065 

838 

95 

0,008 627 

243 

429 

- 0,000 

421 

776 

345 

4 

0,000 

009 

478 

120 

122 

- 8,314 

204 

185 

96 

0,008 302 

225 

008 

- 0,000 

401 

426 

264 

i ! 

0,000 

008 

920 

583 

647 

- 8,365 

571 

731 

97 

0,007 992 

346 

922 

- 0,000 

382 

242 

678 

9 

0,000 

008 

400 

937 

997 

- 8,416 

184 

525 

98 

0,007 696 

771 

961 

- 0,000 

364 

148 

350 

9 

0,000 

007 

916 

268 

497 

- 8,466 

058 

210 

99 

0,007 414 

716 

343 

- 0,000 

347 

071 

828 

8 

0,000 

007 

463 

910 

297 

- 8,515 

208 

040 

100 

0,007 145 

445 

857 

- 0,000 

330 

946 

966 

0 

0,000 

007 

041 

424 

809 

- 8,563 

648 

883 









350 APPROXIMATION BY ORTHOGONAL POLYNOMIALS 


N 

C?f 

<*72 

O 3 

c 74 . 

8 

0,018 648 018 65 

- 0,083 916 083 92 

0,279 720 279 7 

- 0,769 230 769 2 

9 

0,073 426 573 42 

- 0,283 216 783 2 

0,786 713 286 6 

- 1,730 769 231 

10 

0,172 768 408 1 

- 0,583 093 377 2 

1,388 317 565 

- 2,545 248 869 

11 

0,316 742 081 4 

- 0,950 226 244 3 

1,979 638 009 

- 3,110 859 728 

12 

0,500 119 076 0 

- 1,350 321 505 

2,500 595 380 

- 3,438 318 648 

13 

0,715 170 278 6 

- 1,755 417 957 

2,925 696 594 

- 3,575 851 393 

14 

0,953 560 371 5 

- 2,145 510 836 

3,250 773 994 

- 3,575 851 393 

15 

1,207 430 341 

- 2,507 739 938 

3,482 972 136 

- 3,482 972 136 

16 

1,469 915 197 

- 2,834 836 452 

3,634 405 707 

- 3,331 538 565 

17 

1,735 316 552 

- 3,123 569 794 

3,718 535 469 

- 3,146 453 089 

18 

1,999 084 668 

- 3,373 455 378 

3,748 283 753 

- 2,945 080 092 

19 

2,257 707 510 

- 3,585 770 751 

3,735 177 865 

- 2,739 130 434 

20 

2,508 563 900 

- 3,762 845 850 

3,689 064 559 

- 2,536 231 884 

21 

2,749 771 967 

- 3,907 570 690 

3,618 121 010 

- 2,341 137 124 

22 

2,980 048 437 

- 4,023 065 390 

3,529 004 729 

- 2,156 614 001 

23 

3,198 585 323 

- 4,112 466 843 

3,427 055 703 

- 1,984 084 881 

24 

3,404 945 666 

- 4,178 796 954 

3,316 505 519 

- 1,824 078 035 

25 

3,598 977 496 

- 4,224 886 626 

3,200 671 687 

- 1,676 542 312 

26 

3,780 744 036 

- 4,253 337 041 

3,082 128 290 

- 1,541 064 145 

27 

3,950 467 841 

- 4,266 505 268 

2.962 850 880 

- 1,417 015 638 

28 

4,108 486 555 

- 4,266 505 268 

2,844 .136 845 

- 1,303 654 388 

29 

4,255 218 217 

- 4,255 218 217 

2,727 703 985 

- 1,200 189 753 

30 

4,391 134 276 

- 4,234 308 052 

2,613 770 403 

- 1,105 825 940 

31 

4,516 738 804 

- 4,205 239 577 

2,503 118 796 

- 1,019 789 139 

32 

4,632 552 620 

- 4,169 297 358 

2,396 147 907 

- 0,941 343 820 6 

33 

4,739 101 330 

- 4,127 004 384 

2,293 113 547 

- 0,869 801 690 2 

34 

4,836 906 423 

- 4,081 139 795 

2,194 161 180 

- 0,804 525 766 0 

35 

4,926 478 765 

- 4,030 755 353 

2,099 351 746 

- 0,744 931 264 8 

36 

5,008 313 960 

- 3,977 190 498 

2,008 682 070 

- 0,690 484 461 4 

37 

5,082 889 168 

- 3,921 085 930 

1,922 100 946 

- 0,640 700 315 3 

38 

5,150 661 023 

- 3,862 995 767 

1,839 521 794 

- 0,595 139 404 0 

39 

5,212 064 416 

- 3,803 398 357 

1,760 832 573 

- 0,553 404 522 9 

40 

5,267 511 909 

- 3,742 705 830 

1,685 903 527 

- 0,515 137 188 9 

41 

5,317 393 651 

- 3,681 272 528 

1,614 593 214 

- 0,480 014 198 8 

42 

5,362 077 631 

- 3,619 402 401 

1,546 753 163 

- 0,447 744 336 6 

43 

5,401 910 209 

- 3,557 355 503 

1,482 231 460 

- 0,418 065 283 5 

44 

5,437 216 809 

- 3,495 353 663 

1,420 875 473 

- 0,390 740 755 0 

45 

5,468 302 770 

- 3,433 58 S 460 

1,362 533 913 

* 0,365 557 879 0 

46 

5,495 454 ' 224 

- 3,372 210 546 

1,307 058 351 

- 0,342 324 806 3 

47 

5,518 939 071 

- 3,311 363 442 

1,254 304 334 

- 0,320 868 550 6 

48 

5,539 007 940 

- 3,251 156 834 

1,204 132 161 

- 0,301 033 040 2 

49 

5,555 895 159 

- 3,191 684 453 

1,156 407 411 

- 0,282 677 367 0 

50 

5,569 819 708 

- 3,133 023 586 

1,111 001 272 

- 0,265 674 217 1 











351 


CHARLES JORDAN 


N 

^71 




51 

5,580 986 148 

- 3,075 

237 

265 

1,067 

790 

717 


- 0,249 

908 

465 

7 

52 

5,589 585 510 

- 3,018 

376 

176 

1,026 

658 

563 


- 0,235 

275 

920 

7 

53 

5,595 796 161 

- 2,962 

480 

320 

0,987 

493 

440 

1 

- 0,221 

682 

200 

8 

54 

5,599 784 610 

- 2,907 

580 

471 

0,950 

189 

696 

3 

- 0,209 

041 

733 

2 

55 

5,601 706 294 

. 2,853 

699 

433 

0,914 

647 

254 

1 

- 0,197 

276 

858 

7 

56 

5,601 706 294 

- 2,800 

853 

147 

0,880 

771 

429 

8 

- 0,186 

317 

033 

2 

57 

5,599 920 035 

- 2,749 

051 

653 

0,848 

472 

732 

5 

- 0,176 

098 

114 

3 

58 

5,596 473 931 

- 2,698 

299 

931 

0,817 

666 

645 

8 

- 0,166 

561 

724 

1 

59 

5,591 485 987 

- 2,648 

598 

625 

0,788 

273 

400 

4 

- 0,157 

654 

680 

1 

60 

5,585 066 369 

- 2,599 

944 

689 

0,760 

217 

745 

4 

- 0,149 

328 

485 

7 

61 

5,577 317 944 

- 2,552 

331 

941 

0,733 

428 

718 

5 

- 0,141 

538 

875 

5 

62 

5,568 336 756 

- 2,505 

751 

540 

0,707 

839 

418 

1 

- 0,134 

245 

406 

9 

63 

5,558 212 506 

- 2,460 

192 

421 

0,683 

386 

783 

5 

- 0,127 

411 

095 

2 

64 

5,547 028 980 

- 2,415 

641 

652 

0,660 

on 

380 

4 

- 0,121 

002 

086 

4 

65 

5,534 864 443 

- 2,372 

084 

761 

0,637 

657 

193 

9 

- 0,114 

987 

362 

8 

66 

5,521 792 018 

- 2,329 

506 

007 

0,616 

271 

430 

5 

- 0,109 

338 

479 

6 

67 

5,507 880 036 

- 2,287 

888 

630 

0,595 

804 

330 

8 

- 0,104 

029 

327 

6 

68 

5,493 192 356 

- 2,247 

215 

055 

0,576 

208 

988 

4 

- 0,099 

035 

919 

88 

69 

5,477 788 667 

- 2,207 

467 

075 

0,557 

441 

180 

5 

- 0,094 

336 

199 

78 

70 

5,461 724 771 

- 2,168 

626 

012 

0,539 

459 

207 

0 

- 0,089 

909 

867 

83 

71 

5,445 052 839 

- 2,130 

672 

850 

0,522 

223 

737 

8 

- 0,085 

738 

225 

60 

72 

5,427 821 659 

- 2,093 

588 

354 

0,505 

697 

670 

1 

- 0,081 

804 

034 

87 

73 

5,410 076 857 

- 2,057 

353 

171 

0,489 

845 

993 

1 

- 0,078 

091 

390 

31 

74 

5,391 861 111 

- 2,021 

947 

917 

0,474 

635 

6(>1 

2 

- 0,074 

585 

603 

90 

75 

5,373 214 340 

- 1,987 

353 

249 

0,460 

035 

474 

5 

- 0,071 

273 

101 

65 

76 

5,354 173 892 

- 1,953 

549 

933 

0,446 

015 

966 

5 

- 0,068 

141 

328 

22 

77 

5,334 774 712 

i - 1,920 

518 

896 

0,432 

549 

300 

9 

- 0,065 

178 

661 

79 

78 

5,315 049 494 

- 1,888 

241 

268 

0,419 

609 

170 

6 

- 0,062 

374 

336 

17 

79 

5,295 028 837 

- 1,856 

698 

423 

0,407 

170 

706 

9 

- 0,059 

718 

370 

34 

80 

5,274 741 368 

- 1,825 

872 

012 

0,395 

210 

392 

2 

- 0,057 

201 

504 

14 

81 

5,254 213 889 

- 1,795 

743 

987 

0,383 

705 

980 

2 

- 0,054 

815 

140 

03 

82 

5,233 471 471 

- 1,766 

296 

621 

0,372 

636 

418 

0 

*M 

in 

O 

o 

1 

551 

289 

72 

83 

5,212 537 584 

- 1,737 

512 

528 

0,361 

981 

776 

7 

- 0,050 

402 

525 

87 

84 

5,191 434 194 

- 1,709 

374 

674 

0,351 

723 

183 

9 

- 0,048 

361 

937 

79 

85 

5,170 181 853 

- 1,681 

866 

386 

0,341 

842 

761 

4 

- 0,046 

423 

091 

05 

86 

5,148 799 792 

- 1,654 

971 

362 

0,332 

323 

566 

6 

- 0,044 

579 

990 

64 

87 

5,127 306 014 

- 1,628 

673 

675 

0,323 

149 

538 

7 

- 0,042 

827 

047 

30 

88 

5,105 717 359 

- 1,602 

957 

776 

0,314 

305 

446 

2 

- 0,041 

159 

046 

52 

89 

5,084 049 580 

- 1,577 

808 

490 

0,305 

776 

839 

2 

- 0,039 

571 

120 

37 

90 

5,062 317 404 

- 1,553 

211 

022 

0,297 

550 

004 

2 

- 0,038 

058 

721 

46 

91 

5,040 534 631 

- 1,529 

150 

955 

0,289 

611 

923 

4 

- 0,036 

617 

599 

51 

92 

5,018 714 131 

- 1,505 

614 

239 

0,281 

950 

232 

1 

- 0,035 

243 

779 

01 

93 

4,996 867 965 

- 1,482 

587 

198 

0,274 

553 

184 

9 

- 0,033 

933 

539 

70 

94 

4,975 007 387 

- 1,460 

056 

516 

0,267 

409 

618 

2 

- 0,032 

683 

397 

79 

95 

4,953 142 919 

- 1,438 

009 

235 

0,260 

508 

919 

3 

- 0,031 

490 

089 

15 

96 

4,931 284 389 

- 1,416 

432 

750 

0,253 

840 

994 

6 

- 0,030 

350 

553 

71 

97 

4,909 440 973 

- 1,395 

314 

803 

0,247 

396 

241 

6 

- 0,029 

261 

921 

05 

98 

4,887 621 234 

- 1,374 

643 

472 

0,241 

165 

521 

5 

- 0,028 

221 

497 

19 

99 

4,865 833 166 

- 1,354 

407 

170 

0,235 

140 

133 

7 

- 0,027 

226 

752 

32 

100 

4,844 084 216 

- 1,334 

594 

631 

0,229 

311 

792 

3 

- 0,026 

275 

309 

53 
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8 1,846 

9 3.115 

10 3,605 

11 3,733 

12 3,536 

13 3,218 

14 2,860 

15 2,507 

16 2.180 

17 1.887 

18 1,631 

19 1.408 

20 1,217 

21 1,053 

22 0,913 

23 0,793 

24 0,691 

25 0.603 

26 0.528 

27 0,463 

28 0.408 

29 0,360 

30 0.318 

31 0,282 

32 0,251 

33 0.223 

34 0.199 

35 0,178 

36 0,160 

37 0,144 

38 0,129 

39 0,117 

40 0,105 

41 0,096 

42 0,087 

43 0,079 

44 0,072 

45 0,065 

46 0,060 

47 0,055 

48 0,050 

49 0,046 

50 0,042 


153 846 
384 615 
158 371 
031 674 
556 323 
266 254 
681 114 
739 938 
643 424 
871 853 
121 282 
605 652 
391 304 
511 706 
389 459 1 
633 952 2 
229 571 3 
555 232 3 
364 849 8 
750 572 6 
100 503 9 
056 926 0 
477 870 6 
403 146 2 
025 018 8 
663 291 8 
744 328 1 
783 503 6 
370 584 6 
157 570 9 
848 597 2 
191 546 0 
971 078 8 
002 839 75 
128 627 67 
212 369 51 
136 754 77 
800 418 23 
115 575 74 
006 037 25 
405 532 31 
256 296 42 
507 874 74 


- 4,0 

- 4,5 

- 3.970 588 

- 3.235 294 

- 2,554 179 

- 1,992 260 

- 1.549 535 

- 1,207 430 

- 0,944 945 

- 0,743 707 

- 0.589 016 

- 0.469 565 

- 0.376 811 

- 0,304 347 
- 0,247 376 

- 0.202 298 

- 0,166 407 

- 0.137 652 

- 0.114 479 

- 0,095 694 

- 0.080 383 
- 0,067 836 
- 0,057 502 
- 0.048 949 

- 0,041 837 
- 0,035 896 

- 0,030 912 
- 0,026 714 
- 0,023 164 
- 0,020 151 
- 0,017 583 
- 0,015 388 
- 0.013 506 
- 0,011 886 
- 0,010 487 
- 0,009 277 
- 0,008 226 
- 0,007 311 
- 0.006 512 
- 0,005 813 
- 0,005 200 
- 0,004 661 
- 0.004 186 


235 
118 
567 
062 
004 
341 
483 9 
093 8 
018 4 
217 3 
594 2 
820 1 
311 9 
850 6 
119 0 

947 7 
050 8 

502 60 
432 59 
812 15 

948 86 
878 67 

503 14 
577 69 
812 68 
776 39 
640 00 
058 30 
664 21 
788 87 
117 89 
065 87 
705 18 
124 357 
121 158 
157 581 
520 705 
646 213 
570 794 
487 236 
381 603 


8,0 

4.5 

2.647 058 
1,617 647 
1,021 671 
0,604 086 
0,442 724 
0,301 857 
0,209 987 
0.148 741 
0.107 093 
0,078 200 
0,057 971 
0,043 478 
0,032 983 
0,025 287 
0,019 577 
0,015 294 
0,012 050 
0.009 509 
0.007 655 
0,006 166 
0,005 ()()() 
0,004 079 
0.003 347 
0,002 761 
0,002 289 
0,001 908 
0,001 597 
0,001 343 
0.001 134 
0,000 961 
0,000 818 
0,000 699 
0,000 599 
0,000 515 
0,000 444 
0,000 384 
0,000 333 
0,000 290 
0,000 253 
0,000 221 
0,000 194 


! 824 
' 059 
827 

• 687 3 
458 2 
1 585 2 
885 3 
418 8 
821 5 
869 56 
014 50 
260 87 
508 25 
356 32 
308 12 
771 97 
426 40 
456 260 
565 009 
982 923 
256 422 
156 556 
000 251 
275 207 
837 976 
198 314 
561 379 
403 887 
429 949 
799 304 5 
552 599 6 
180 345 5 
297 439 0 
395 797 6 
655 197 7 
797 767 4 
975 420 8 
682 310 7 
686 380 2 
975 582 7 
715 423 4 
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N 


51 

0,039 

116 

107 

67 

52 

0,036 

042 

268 

70 

53 

0,033 

252 

330 

13 

54 

0,030 

716 

336 

31 

55 

0,028 

407 

867 

66 

56 

0,026 

303 

581 

16 

57 

0,024 

382 

815 

83 

58 

0,022 

627 

253 

09 

59 

0,021 

020 

624 

01 

60 

0,019 

548 

456 

31 

61 

0,018 

197 

855 

42 

62 

0,016 

957 

314 

55 

63 

0.015 

816 

549 

75 

64 

0,014 

766 

356 

31 

65 

0,013 

798 

483 

54 

60 

0,012 

90 S 

525 

46 

67 

0,012 

080 

825 

14 

68 

0,011 

318 

390 

84 

69 

0,010 

612 

822 

48 

70 

0,009 

959 

246 

898 

71 

0,009 

353 

260 

975 

72 

0,008 

790 

881 

359 

73 

0.008 

268 

500 

140 

74 

0,007 

782 

845 

624 

75 

0,007 

330 

947 

598 

76 

0,006 

910 

106 

524 

77 

0.006 

517 

866 

179 

78 j 

0.006 

151 

989 

321 

79 

0,005 

810 

436 

033 

80 

0,005 

491 

344 

397 

81 

0,005 

193 

013 

266 

82 

0,004 

913 

886 

831 

83 

0,004 

652 

540 

849 

84 

0,004 

407 

670 

279 

85 

0,004 

178 

078 

195 

86 

0,003 

962 

665 

835 

87 

0,003 

760 

423 

665 

88 

0,003 

570 

423 

313 

89 

0,003 

391 

810 

318 

90 

0,003 

223 

797 

583 

91 

0,003 

065 

659 

494 

92 

0,002 

916 

726 

539 

93 

0,002 

776 

380 

521 

94 

0,002 

644 

050 

158 

95 

0,002 

519 

207 

132 

96 

0,002 

401 

362 

491 

97 

0,002 

290 

063 

387 

98 

0,002 

184 

890 

105 

99 

0.002 

085 

453 

369 

100 

0,001 

991 

391 

880 


_ ^76 

- 0,003 766 736 205 
- 0,003 395 286 182 

- 0,003 065 817 671 
- 0,002 773 002 583 
- 0,002 512 260 405 
- 0,002 279 643 701 
- 0,002 071 742 521 
- 0,001 885 604 424 
- 0,001 718 667 372 
- 0.001 568 703 284 
- 0,001 433 770 427 
- 0,001 312 173 150 
- 0.001 202 427 759 
- 0,001 103 233 517 
- 0.001 013 447 944 
- 0,000 932 065 727.8 
- 0,000 858 200 693 1 
- 0.000 791 070 327 8 
- 0,000 729 982 498 3 
- 0,000 674 324 008 7 
- 0.000 623 550 731 7 
- 0,000 577 179 079 1 
- 0,000 534 778 516 0 
- 0,000 495 905 652 5 
- 0,000 460 307 675 3 
- 0.000 427 768 499 1 
- 0,000 397 804 039 1 
- 0,000 370 258 616 5 
- 0.000 344 911 72 S 0 
- 0,000 321 565 212 5 
- 0.000 300 040 766 5 
- 0.000 280 177 757 9 
- 0.000 261 831 303 2 
- 0.000 244 870 571 1 

- 0,000 229 177 284 9 
- 0,000 214 644 399 4 
- 0,000 201 174 928 6 
- 0,000 188 680 906 8 
- 0.000 177 082 466 4 
- 0,000 166 307 018 2 
- 0.000 156 288 523 2 
- 0.000 146 966 841 1 
- 0.000 138 2 S 7 152 4 
- 0,000 130 199 439 6 
- 0.000 122 658 025 1 
- 0,000 115 621 157 0 
- 0,000 109 050 637 5 
- 0,000 102 911 490 5 
- 0,000 097 171 662 36 
- 0.000 091 801 753 35 



0,000 171 215 286 1 

0,000 150 901 608 1 

0,000 133 296 420 5 

0,000 118 000 109 9 
0,000 104 677 516 9 
0,000 093 046 681 66 
0,000 082 869 700 85 
0,000 073 945 271 54 
0,000 066 102 591 23 
0,000 059 196 350 34 
0,000 053 102 608 42 
0.000 047 715 387 27 
0,000 042 943 848 54 
0,000 038 709 947 97 
0,000 034 946 480 82 
0,000 031 595 448 40 
0,000 028 606 689 77 
0,000 025 936 732 06 
0,000 023 547 822 S 3 
0.000 021 407 111 39 
0,000 019 485 960 3 ft 
0,000 017 759 356 28 
0,000 (lift 205 412 61 
0,000 014 804 944 85 
0,000 013 541 108 10 
0,000 012 399 086 93 
0,000 011 365 829 69 
0,000 010 429 820 18 
0.000 009 580 881 333 
0,000 008 810 005 821 
0,000 008 109 209 905 
0,000 007 471 406 878 
0,000 006 890 297 453 
0,000 006 360 274 573 
0,000 005 876 340 639 
0,000 005 434 035 427 
0.000 005 029 373 215 
0,000 004 658 787 822 
0,000 004 319 084 546 
0,000 004 007 398 028 
0,000 003 721 155 314 
0,000 003 458 043 320 
0,000 003 215 980 288 
0,000 002 993 090 565 
0,000 002 787 682 389 
0,000 002 598 228 247 
0.000 002 423 347 499 
0,000 002 261 790 999 
0,000 002 112 427 443 
0,000 001 974 231 255 
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Tabu EZ 

The values of the binomial coefficients. 
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% 

(*) 

(*) 

CD 

0 

0 

X 

56 

1 540 

27 720 

367 290 

3 819 816 

32 468 436 

56 

57 

1 596 

29 260 

305 010 

4 187 106 

36 288 252 

57 

58 

1 653 

30 856 

424 270 

4 S82 116 

40 475 358 

58 

59 

1 711 

32 509 

455 126 

5 006 386 

45 057 474 

59 

60 

w 

34 220 

487 635 

5 461 512 

50 063 860 

60 

61 

II 

35 990 

521 855 

5 949 147 

55 525 372 

61 

62 

1 891 

37 820 

557 845 

6 471 002 

61 474 519 

62 

63 

1 953 

39 711 

505 665 

7 028 847 

67 945 521 

63 

64 


41 664 

635 376 

7 624 512 

74 974 368 

64 

65 

m 


677 040 

8 259 888 

82 598 880 

65 

66 

2 145 

45 760 

720 720 

8 936 928 

90 858 768 

66 

67 

2 211 

47 905 

766 480 

9 657 648 

99 795 696 

67 

68 

2 278 

50 116 

814 385 

10 424 128 

109 453 344 

68 

69 

2 346 

52 394 

864 501 

11 238 513 

119 877 472 

69 

70 

2 415 


oio 895 

12 103 014 

131 115 985 

70 

71 

2 485 

57 155 

971 635 

13 019 909 

143 218 990 

71 

72 

2 556 


1 028 790 

13 991 544 

156 238 908 

72 

73 

2 628 

62 196 

1 088 430 

15 020 334 

170 230 452 

73 

74 

2 701 

64 824 

1 150 626 

16 108 764 

185 250 786 

74 

75 

2 775 

67 525 

1 215 450 

17 259 390 

201 359 55(1 

75 

El 

2 850 


1 282 975 

18 474 840 

218 618 940 

wni 

km 

2 926 


1 353 275 

19 757 815 

237 093 780 


-K1 

■titu™ 

76 076 

1 426 425 

21 111 090 

256 851 595 


79 

3 081 

79 079 

1 502 501 

22 537 515 

277 962 685 


80 

3 m 

82 100 

1 581 580 

24 040 016 

300 500 200 

V 

81 

3 240 

85 320 

1 663 740 

25 621 5% 

324 540 216 

81 

82 

3 321 


1 749 060 

27 285 336 

350 161 812 

82 

83 

3 403 

91 881 

1 837 620 

29 034 396 

377 447 148 

83 

84 

3 486 

95 284 

1 929 501 

30 872 016 

406 481 544 ! 

84 

85 

3 570 


2 024 785 

32 801 517 

437 353 560 

85 

86 

3 655 


2 123 555 

34 826 302 

470 155 077 

86 

87 

3 741 

105 995 

2 225 895 

36 949 857 

504 981 379 

87 

88 

3 828 

100 736 

2 331 890 

39 175 752 

541 931 236 

88 

89 

3 910 

113 564 

2 441 626 

41 507 642 

581 106 988 

89 

90 


117 480 

2 555 190 

43 949 268 

622 614 630 

90 

91 

4 095 

121 485 

2 672 670 

46 504 458 

666 563 898 

91 

92 

4 186 

125 580 

2 794 155 

49 177 128 

713 068 356 

92 

93 

4 278 

129 766 

2 919 735 

51 971 283 

762 245 484 

93 

94 

4 371 


3 049 501 

54 891 018 

814 216 767 

94 

95 

4 405 

138 415 

3 183 545 

57 940 519 

869 107 785 

95 

96 


142 880 

3 321 960 

61 124 064 

927 048 304 

96 

97 

4 656 


3 464 840 

64 446 024 

988 172 368 

97 

98 

4 753 


3 612 280 

67 910 864 

1 052 618 392 

98 

99 

4 851 

156 849 

3 764 376 

71 523 144 

1 120 529 256 

99 

100 


■" 3 

3 921 225 

75 287 520 

1 192 052 400 

100 

101 


■ 

4 082 925 

79 208 745 

1 267 339 920 

101 

102 

5 151 


4 249 575 

83 291 670 

1 346 548 665 

mwmm 

103 

5 253 

170 851 

4 421 275 

87 541 245 

1 429 840 335 


104 

5 356 


4 598 126 

91 962 520 

1 517 381 580 

104 

105 

5 460 


4 780 230 

96 560 646 

1 609 344 100 

105 

106 

5 565 

■ 

4 967 690 

101 340 876 

1 705 904 746 

106 

107 

5 671 

198 485 

5 160 610 

106 308 566 

1 807 245 622 

107 

108 

5 778 


5 359 095 

111 469 176 

1 913 554 188 

108 

109 

5 886 


5 563 251 

116 828 271 

2 025 023 364 

109 

310 

5 995 

Bbi 

5 773 185 

122 391 522 

2 141 851 636 

110 
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The values of the binomial coefficients. 


(?) (?o> 




36 



9 




1 



120 



45 




10 



330 



165 




55 



792 



495 




220 


1 

716 


1 

287 




715 


3 

432 


3 

003 



3 

002 


6 

435 


6 

435 



5 

005 


11 

440 


12 

870 



11 

440 


19 

448 


24 

310 



24 

no 


31 

824 


43 

758 



48 

620 


50 

388 


75 

582 



92 

378 


77 

520 


125 

970 



167 

960 


116 

280 


203 

490 



293 

930 


170 

544 


319 

770 



497 

420 


245 

157 


490 

314 



817 

190 


346 

104 


735 

471 


1 

307 

>04 


480 

700 

1 

081 

575 


2 

042 

975 


657 

800 

1 

562 

275 


3 

124 

550 


888 

030 

2 

220 

075 


4 

686 

825 

1 

184 

040 

3 

108 

105 


6 

906 

900 

1 

560 

780 

4 

292 

145 


10 

015 

005 

2 

035 

800 

5 

852 

925 


14 

307 

150 

2 

629 

575 

7 

888 

725 


20 

160 

075 

3 

365 

856 

10 

518 

300 


28 

048 

800 

4 

272 

048 

13 

884 

156 


38 

567 

100 

5 

379 

616 

18 

156 

204 


52 

451 

256 

6 

724 

520 

23 

535 

820 


70 

607 

460 

8 

347 

680 

30 

260 

340 


94 

143 

280 

10 

295 

472 

38 

608 

020 


124 

403 

620 

12 

620 

256 

48 

903 

492 


163 

Oil 

640 

IS 

380 

937 

61 

523 

748 


211 

915 

132 

18 

643 

560 

76 

904 

685 


273 

438 

880 

22 

481 

940 

95 

548 

245 


350 

343 

565 

26 

978 

328 

118 

030 

185 


445 

891 

810 

32 

224 

114 

145 

008 

513 


563 

921 

995 

38 

320 

568 

177 

232 

627 


708 

930 

508 

45 

379 

620 

215 

553 

195 


886 

163 

135 

53 

524 

680 

260 

932 

815 

1 

101 

716 

330 

62 

891 

499 

314 

457 

495 

1 

362 

649 

145 

73 

629 

072 

377 

348 

994 

1 

677 

106 

640 

85 

900 

584 

450 

978 

066 

2 

054 

455 

634 

99 

884 

400 

536 

878 

650 

2 

505 

433 

700 

115 

775 

100 

636 

763 

050 

3 

042 

312 

350 

133 

784 

560 

752 

538 

150 

3 

679 

075 

400 

154 

143 

080 

886 

322 

710 

4 

431 

613 

550 

177 

100 

560 

1 040 

465 

790 

5 

317 

936 

260 

202 

927 

725 

1 217 

566 

350 

6 

358 

402 

050 


716 
646 646 
1 144 (166 
1 961 i.id 
3 208 760 

5 .111 755 
8 4(6 285 
15 125 110 
20 050 010 
10 045 015 

44 552 165 
(4 512 240 
92 561 040 
151 128 140 
185 579 596 

254 186 856 
348 3.(0 136 
472 733 756 
635 745 396 
847 660 528 

1 121 099 408 
1 471 442 973 

1 917 334 783 

2 481 256 778 

3 190 187 286 

4 076 350 421 

5 178 066 751 

6 540 715 896 
8 217 822 536 
10 272 278 170 

12 777 711 870 
15 820 024 220 
19 499 099 620 
23 930 713 170 
29 248 649 430 
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35 607 051 480! 

43 183 019 880 
52 179 482 355 
62 828 356 305 
75 394 027 566 

90 177 170 226 
107 518 933 731 
127 805 525 001 
151 473 214 816 
179 013 799 328 

210 980 549 208 
247 994 680 648 
290 752 384 208 
340 032 449 328 
396 704 524 216 

4ol 738 052 776 
536 211 932 2561 72 
621 324 937 376 73 

718 406 958 8411 74 
828 931 106 355 

954 526 728 530 76 

1 096 993 404 430 77 

1 258 315 963 905 78 

1 440 680 596 355 79 

I 646 492 110 120 80 

1 878 392 407 320 81 

2 139 280 241 670 82 

2 432 332 329 570 83 

2 761 025 887 620 84 

3 129 162 672 636 85 

3 540 894 603 246 86 

4 000 751 045 226 87 

4 513 667 845 896 88 

5 085 018 206 136 89 

5 720 645 481 903 90 

6 426 898 010 533 91 

7 210 666 060 598 92 

8 079 421 007 658 93 

9 041 256 841 903 94 

10 104 934 117 421 95 

II 279 926 456 656 96 

12 576 469 727 536 97 

14 005 614 014 756 98 

15 579 278 510 796 99 

17 310 309 456 440 100 

19 212 541 264 840 101 
21 300 860 967 540 102 
23 591 276 125 340 103 
26 100 986 351 440 104 
28 848 458 598 960 105 

31 853 506 369 685 106 
35 137 373 005 735 107 
38 722 819 230 810 108 
42 634 215 112 710 109 
46 897 636 623 981 110 
























































































































































































CONCERNING THE LIMITS OF A MEASURE 
OF SKEWNESS 

By RAYMOND GARVER 
University of California at Los Anycfcs 

In a recent note in the Annals of Mathematical Statistics,* 
Hotelling and Solomons devised an ingenious method of showing 
that the measure of skewness s defined by the equation 

■mean- median 

Cj = ————i——— 11 . . . 

standard dev/at ion 

cannot be greater than unity in absolute value. I am venturing to 
offer another proof of the same fact, which seems to me to be of 
interest because it employs an important and well-known algebraic 
inequality. 

With Hotelling and Solomons, 1 shall assume that we arc* con¬ 
cerned with 77 readings, or X > S > with median zero and mean X , 
where x of course is U, x/n We may show that the absolute 
value of s cannot be greater than one by showing that l/s z is 
not less than one. Making obvious substitutions, we must then 
show that 


”Zx± > , 
rz:*)* = 

Now according to a known theorem if <3,4 •■ •••, k arc n positive 
numbers, and if 777 is a number not lying between aero and 011c, 
then 

77 = ( 77 / 


♦Vol 3, no. 2, May, 1932, 141-2. 
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While the proof of this theorem is given in Chrystal, we shall 
outline a (simplified) proof for the case m=2, to make this note 
.self-contained. For any number r we obviously have (r-Jj z =0. 
\ T ow let r equal , nb/^ turn. 

The first of these gives 


_ £ na 

(&+b+-- + k)* (<3+b+ ■ +k) 


+ 1^0, 


while the others give similar inequalities. Summing these inequal¬ 
ities we have 


77 z (+ • • •• + k z ) 




k)* 


- 2 r?+ n £ O. 


which is ChrystaPs theorem * for rr?* 2. The proof shows that 
some of the numbers a,b } k can be zero; in fact, some can 
be negative, provided a + b+ ./ k is not zero. 

Now, suppose we have an odd number of readings, say v*£s+l. 
Since the median reading is zero, there are s non-negative read¬ 
ings, which we shall now call ys, and 5 non-positive readings, 
which we shall call %'$. We have at once, by the above, 

&Zy* * , 

Gy/* = ' 


sZzl > i 

(Zz) z = 7 ' 


It follows immediately that 


YZy z +ZB z )> 

( (Zy)*+CZz)*J = 


1 


♦Chrystal, Algebra, Part II, 2nd ed., 1922, p. 49, 
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3 <50 

and , since 77 « 2 s +1 that 


77 


fEy*+Zz 2 ) 
l(Zy)*+(Z*) z J 


>2 . 


Finally, 

zx* zyl±zjL _ 

71 (Ex)* ” (Zy+Zz) 2 ~ 77 (Ey) 2 + (Zz) 2 +2(Zy)(Ez) 


since Z(Zy)(Es) is certainly not positive. 

This proof is valid unless all the y's are zero or all the i?'s are 
zero. Suppose the latter is the case. Then 


n Zx z _ r>Ey 2 

fax) 2 " (ZyJ 2 


>z 



>2, 


If all the readings are zero our definition of s does not give a 
definite value. 

If n is even, not odd, the proof may be modified by properly 
defining the median. In this case we can show again that 


(Zx) z 


Z2, 


but the possibility of the equality cannot be ruled out. 





EDITORIAL. 


Trapezoidal Rule for Computing Seasonal Indices. 

The following method for computing seasonals is suggested by 
the Detroit Edison article on “A Mathematical Theory of Sea¬ 
sonal/’ that appeared in Vol. 1, No. 1 of the Annals. 

We shall likewise define “the seasonal index for any month as 
the ratio of the total of the variates for the month in question to 
the total that would have been experienced if neither accidental 
nor seasonal influences were present”, that is. the seasonal index 
for the i-th month is 


( 1 ) 



The numerator presents no difficulties: the obstacle is met in 
determining the denominator, since iff fa) is the unknown function 
that is the consequence of only trend and cycle influences. Accord¬ 
ing to accepted concepts the trend may be represented by some 
smoqth analytic function, the cycle is a smooth though not a 
mathematically periodic function—but the seasonal and residual 
influences may inject all sorts of disturbances into a time series. 
We shall make but two further assumptions,— 

(a) The smooth function y=l)ffa ), representing the combined 
effect of trend and cycle, may be approximated by the upper sides 
of a series of trapezoids as in figure (1). The area of each 
trapezoid is to equal the area under the function llffa) limited by 
the common ordinates. 

(b) Neither seasonal nor accidental influences affect annual 
totals. Thus we might assume that the seasonal activity in the 
production of coal does not affect the total coal mined within the 
year, but merely concentrates production within certain months 
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and compensates this with a corresponding under-production in 
others. Although accidental disturbances within the year would 
merely attribute production to one month rather than the next, we 
must admit that if one of these months is the last of one year, and 
the other the first of the following year, such an accidental fluctua¬ 
tion will affect the annual totals. Usually, however, such perturba¬ 
tions represent but a small percent of the monthly production, and 
a negligible part of the annual total. 
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Let us assume that we are dealing with 2n consecutive years: 
then by the above assumptions the area of the rt'th trapezoid 
equals the total production for the r?-zV?year, T^ . 

Designating the ordinates corresponding to the v~th trapezoid 
by u n ^f and u n , it follows since we are dealing with equal unit 
bases, 

T n “ U r,-I + u r> )t that is 


a n m 2T„ - U n-1 > so that 


' u i - 2T 1 -u 0 
u z m * 2T Z -2T, +u 0 

a, * 2T 3 ~u z *2T 3 -2Tz+2T,-u 0 


( 2 ) 




2T 4 ~u 3 * 2T^-2T 3 +2T z - ZT f +u 0 


U ZT}-f ^Zrhf U 2r,-2*2T Z77 _ 1 -2T Zrj _ z + • * * ' + 2T, ~O 0 

?2r} ~ 2T Zt ?- u 2n_f £T~ 2r7 ~ 2T Zn1 + +2T Z - 2Tf+a 0 

By elementary geometry, the area corresponding to the twelve 
months of the 7?-Myear are 



1 

*233 

-( 2 ? u n- 

1 * u nJ 


*266 

( Zla T7-, 

r * ^ ) 

"s 

1 

*203 


* 5 u n) 

Mi, 

i 

m 266 

(3“r,-1 

+ *fu„) 

M„ 

1 

~268 

C u r?-f * 

Z3u„) 


According to definition, E ft equals the sum of the values of 
Mf, one for each of the 2n years. Thus, 
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22 ~2B0 (23 u 0 + u f 

+23u, + u 2 
+23 u z +u 3 


+ & U *»-Z* U 2».1 

+23 u Zn _ f + u Zri ), 


that is 


<3) < 


1 

'296 


24(u 0 +u y 


•• y U 


277 . 


-*)+ 6*A 


£77 


■ a) 

O' 


and in precisely the same manner 

2 % "266 *4 («*+Uf * ■‘’ * “ivJ+Sfajin ' ^ ) 

^ L % = IsS 24 ^ U ° * u l*‘"* u Jtn. l)+5( a *n ' u o ) 


But by (2) 

U D * u i i nig + + u Z t) _ f “ £T y +2T 3 + 21 r s + ■ ■ • 

- 2 - 0 , 

where O designates the sum of the totals for the odd years. 
Again, 


“in’ -* T < 


-2E-20, 


E representing the corresponding sum for the even years. 
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We have finally that 

r < 23 O+E) 

(4) , 

S - 

% 

where 6 represents the common difference > which 

from equation (3) is seen to he 

m J&Q ^^ a zrt ~ ^ u o ) 


It should be observed that we have not imposed the condition 
that the long time trend is a straight line—no matter what the 
law of growth may be, the assumption that the trend-cycle func¬ 
tion, iffa), may be approximated within each year by a secant 
line is alone responsible for the equal differences 5 . 

As an illustration let us compute the seasonals for the theoretical 
series presented in the Detroit Edison article, and reproduced 
below. 

Table 1 

Theoreiical Series. 


1 

1904 

1905 

1906 

1907 

1908 

1909 

■ 


1662 


2030 

1242 

1714 

1 

■ 

1582 

1860 

1855 

1052 

1831 

1 

■ ■ 

1913 


2077 

1283 

1831 


■SB 

1976 

2027 

2088 

1210 

2077 


1343 

1892 

2122 

2043 

1203 

2143 

June 

1236 

1700 

1672 

2093 

1166 

2058 

July 

1254 

2092 

2041 

2060 

1240 

2320 

Aug. 

1702 

1757 

1846 

2163 

1334 

2413 

Sept. 

1457 

1906 

2102 

2262 

1279 

2502 

Oct. 

1564 

1899 

2304 

1946 

1364 

2643 

Nov. 

1596 

1611 

2303 

1475 

1341 

2378 

Dec. 

1836 

2163 


1153 

1564 

2595 

Total 

16061 

22153 

24407 

23245 ! 

15278 

26505 
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TABLE 1 Continued 



1910 

1911 

1912 

1913 

1914 

1915 

Jan. 

2392 

1933 

2052 

2554 

2041 

1687 

Feb. 

2514 

1746 

2061 

2330 

1780 

1532 

Mar. 

2417 

1895 

2267 

2554 

2194 

1906 

Apr. 

2830 

1865 

2490 

2800 

2037 

1796 

May 

2702 

2167 

2419 

2845 

2156 

2539 

June 

2475 

1732 

2571 

2696 

1730 

2674 

July 

2211 

1742 

2657 

2314 

1577 

2566 

Aug. 

2249 

2011 

2469 

2525 

1649 

2661 

Sept. 

2108 

1976 

2591 

2377 

1652 

2952 

Oct. 

2203 

2144 

2516 

2850 

1753 

3342 

Nov. 

1875 

2168 

2389 

2494 

1585 

3093 

Dec. 

1899 

2092 

2435 

2150 

1779 

3182 

Total 

27875 

23471 

28917 

30489 

21933 

29930 


E = 157 793 
0~ 134 471 
El!7 r 22 509. 90 
6 * 296.14 

Column 1 of table 2 is obtained by adding the items of table 1 
horizontally. Column 2 is found by repeated adding the common 
difference, & - 296.14 to the value 22 559.90. 


Table 2 

Seasonals by Trapezoidal Rule. 


Month 

Vi 

z<P; 

S 

Jan. 

22 121 

22 560 1 

.981 

Feb. 

20 957 

22 856 

.917 

Mar. 

23 527 

23 152 

1.016 

Apr. 

24 411 

23 448 

1.041 

May 

25 574 

23 744 

1.077 

June 

23 803 

24 041 

.990 

July 

24 074 | 

24 337 

1.015 

Aug. 

24 779 

24 633 

1.006 

Sept. 

25 164 

24 929 

1.009 

Oct, 

26 528 

25 225 

1.052 

Nov. 

24 308 

25 521 

.952 

Dec. 

25 018 

25 817 

.969 

Total 

290 264 

290 263 

12.025 
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The following table presents the mean and standard errors for 
the results obtained by Link-relative method, the Interpolation 
method suggested in the Detroit Edison article, and the Trape¬ 
zoidal method. Obviously, the last requires by far the least time 
in application. ' 


Table 3 


Method 

M.D 


Link Relative 

.0277 

.0338 

Interpolation 

.0269 

.0337 

Trapezoidal 

0227 

.0255 


For weekly indices the formulae by the trapezoidal rule are 




If one has data for Zrt+1 years, he may either disregard the 
most distant year and then compute seasonals for the final 
years, or he may combine the results for the first 2n years (neg¬ 
lecting the last year) and the results for the last 2n years (neg¬ 
lecting the first year). This yields an almost equally simple 
formula. 

Applying the Trapezoidal rule to successive overlapping periods 
will reveal the presence of a shifting seasonal. The change in the 
seasonal for Automobile production, caused by the advent of good 
roads and closed models, affords a good example. 
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